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MESOSCOPIC HIGHER REGULARITY AND 
SUBADDITIVITY IN ELLIPTIC HOMOGENIZATION 

SCOTT ARMSTRONG, TUOMO KUUSI, AND JEAN-CHRISTOPHE MOURRAT 


Abstract. We introduce a new method for obtaining quantitative results 
in stochastic homogenization for linear elliptic equations in divergence form. 
Unlike previous works on the topic, our method does not use concentration 
inequalities (such as Poincare or logarithmic Sobolev inequalities in the prob¬ 
ability space) and relies instead on a higher (C , k> 1) regularity theory for 
solutions of the heterogeneous equation, which is valid on length scales larger 
than a certain specified mesoscopic scale. This regularity theory, which is of 
independent interest, allows us to, in effect, localize the dependence of the 
solutions on the coefficients and thereby accelerate the rate of convergence 
of the expected energy of the cell problem by a bootstrap argument. The 
fluctuations of the energy are then tightly controlled using subadditivity. 
The convergence of the energy gives control of the scaling of the spatial 
averages of gradients and fluxes (that is, it quantifies the weak convergence 
of these quantities) which yields, by a new “multiscale” Poincare inequality, 
quantitative estimates on the sublinearity of the corrector. 


1. Introduction 

In this paper, we introduce a new method for obtaining quantitative results 
in elliptic homogenization. It is based on a new regularity theory for higher 
derivatives of solutions which is valid on mesoscopic scales. The regularity es¬ 
timates essentially localize the dependence of the solutions on the coefficients, 
allowing for the mixing assumptions to dictate the rate of homogenization. 
This idea is formalized by a novel bootstrap argument which uses the regu¬ 
larity estimates to accelerate the convergence of the natural subadditive and 
superadditive quantities associated to the variational formulation of the equa¬ 
tion. As an application, we give explicit estimates on the sublinearity of the 
correctors and the weak convergence of their rescaled gradients. 

1.1. Motivation and informal summary of results. We consider the linear 
elliptic equation 

(1.1) - V • (a(r)V«) = 0 

in bounded open subsets of MA The coefficient a(-) is a random field valued 
in the set of d-hy-d symmetric matrices with eigenvalues belonging to the in¬ 
terval [1,A] for a fixed ellipticity constant A > 1. The law of a(-) is given 
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by a probability measure P which is assumed to satisfy a quantitative mixing 
condition. 

We are interested in obtaining quantitative information on the statistical 
properties of solutions of (1.1) on very large scales, a concern which lies within 
the realm of homogenization. The qualitative theory of homogenization for 
such equations was developed by [21, 18, 23, 17] and the first quantitative 
results are due Yurinskii [24]. Developing a quantitative theory of stochastic 
homogenization for linear elliptic equations has received a lot of attention since 
the groundbreaking work of Gloria and Otto [13, 14, 15] and Gloria, Neukamm 
and Otto [10, 11], who proved an array of optimal estimates under certain very 
strong mixing assumptions on the coefficients (in particular, that P satisfies 
some form of a spectral gap inequality). At the core of their results are the 
use of sensitivity estimates and the spectral gap assumption to derive moment 
bounds on the gradient of the corrector and the Green’s functions, which cul¬ 
minated in the work of Marahrens and Otto [19]. These gradient estimates, 
following Naddaf and Spencer [20], give control of the sensitivity of the correc¬ 
tors themselves to changes in the coefficient field, and thereby yield optimal 
estimates after another application of the spectral gap inequality. 

A regularity theory for stochastic homogenization was recently introduced by 
Armstrong and Smart [4]. In particular, [4, Theorem 1.2] provides a gradient 
bound, of exactly the sort required for the quantitative theory of homogeniza¬ 
tion, which applies to arbitrary solutions of (1.1) and with much stronger (and 
essentially optimal) stochastic integrability. The techniques of [4] were further 
developed by Armstrong and Mourrat [2], for general (nonlinear) divergence- 
form equations and systems, and, in a similar spirit by Gloria, Neukamm and 
Otto [12] for linear equations and systems. These works were inspired by the 
celebrated papers of Avcllaneda and Lin [5, 6], who proved in particular uni¬ 
form Lipschitz estimates for equations with periodic coefficients. As in [5, 6], 
the philosophy is to show that solutions of the heterogeneous equation (1.1) in¬ 
herit the regularity of the limiting constant-coefficient equation due to homog¬ 
enization. The method of [4] is however different from these previous works, 
which were based on compactness arguments and used strong bounds on the 
correctors. Instead, the idea is to mimic more closely the proof of the classical 
Schauder estimates, using quantitative methods to replace compactness: one 
shows that, at each scale, a solution of (1.1) may be approximated by the solu¬ 
tion of the homogenized equation and then iterates the resulting estimate over 
dyadic scales. If the error in the approximation is small enough (an algebraic 
or Dini rate of homogenization is needed), then the argument yields a uniform 
Lipschitz estimate. This argument is more robust than the compactness argu¬ 
ment of [5, 6], since it separates the approximation step from the iteration step. 
Indeed, in addition to its applicability in the stochastic setting, it has yielded 
new results even in the periodic and almost periodic settings (cf. [3, 22]). 

Another idea from [4, 2] is that subadditive arguments are the way to prove 
quantitative homogenization results that are optimal in stochastic integrability 
(under essentially any mixing assumption). This leads to a regularity theory 
which is optimal in terms stochastic integrability. Subadditive arguments are 
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so effective for this purpose because they reduce questions of stochastic inte- 
grability for random variables with complicated dependence on the random 
environment (such as solutions of the PDE) to estimates of a finite sum of 
bounded random variables which satisfy the same mixing properties as the 
coefficients. Under assumptions on the coefficient held which provide a spec¬ 
tral gap inequality, the regularity theory (specifically the Lipschitz estimate) is 
strong enough to quickly recover the optimal quantitative estimates on the cor¬ 
rector proved by Gloria and Otto. Thus an important consequence of these new 
regularity estimates on the quantitative theory was to separate the gradient es¬ 
timates, which can now be proved under very general mixing assumptions, from 
the rest of the quantitative theory, which until now requires spectral gap-type 
assumptions. 

In the present paper, we propose to take this program one step further by de¬ 
riving quantitative bounds for the correctors (with explicit exponents) without 
using the spectral gap or other Poincare-type concentration inequalities. This 
is the first step in obtaining a quantitative theory of stochastic homogenization 
applicable to general coefficient fields. Instead of relying on concentration of 
measure, we show by a bootstrap argument that the regularity theory itself 
can be used to improve the rate of homogenization. In addition to providing a 
new point of view in the quantitative theory of stochastic homogenization and 
allowing for more general mixing assumptions, our arguments yield estimates 
which are much stronger in stochastic integrability (under strong mixing as¬ 
sumptions, we get exponential moments rather than just pth moments). This 
is because, as in [4, 2], our method allows us to use subadditivity to control the 
stochastic fluctuations. So far, the estimates we can obtain (see Theorem 1.2, 
below) are unfortunately suboptimal in their scalings compared to what can be 
proved under spectral gap assumptions. This is due to the presence of bound¬ 
ary layers encountered in the analysis which eventually force the bootstrap to 
halt before desired. We hope to address this issue in the near future. 

In addition to the uniform Lipschitz estimate, our approach requires higher 
C k 4 estimates for k > 1, which were left essentially implicit in [4, 2], Of course, 
unlike the Lipschitz estimate, such higher derivative estimates for k > 1 cannot 
hold uniformly on the unit scale, but they are valid on mesoscopic scales: the 
assertion, which is stated precisely in Theorem 2.1 below, is roughly that a 
solution of (1.1) on the ball Br, with R » 1, can be well-appoximated by 
a A;th degree polynomial on any mesoscopic ball B r (x ) £ -Br /2 with radius 
r > i? 1-e , for a specified exponent e > 0 which depends in particular on k. By 
“well-approximated”, we mean that the quality of the approximation of the 
solution by the polynomial is, up to a constant, as good as one would have for 
a harmonic function with the same oscillation. This result is presented below 
in Theorem 2.1. Fischer and Otto [9] have very recently developed a higher 
regularity theory along somewhat different lines (see also Remark 2.3). 

While we state and prove our results for scalar equations with symmetric 
coefficients under the assumption of finite range of dependence, we emphasize 
that these choices are not imposed on us by any limitation of our method. 
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Indeed, the arguments work verbatim for systems with just changes to the no¬ 
tation, the techniques of [2] can be used to get rid of the symmetry assumption, 
and it is precisely under weaker mixing conditions that our estimates would 
be optimal, since the weak mixing stops the bootstrap before it sees boundary 
layers. Our choice to make these additional assumptions rather reflects a de¬ 
sire to maintain the readability of the paper by presenting the ideas in their 
simplest setting. 


1.2. Assumptions. We work in the Euclidean space WL d in dimension d > 2 
and with a fixed ellipticity constant A > 1. We consider the space of coefficient 
fields a(-) valued in the symmetric d-bj-d matrices satisfying, for all £ e R d , 

(1.2) |£| 2 <£-a(x)£<A|£| 2 . 

We define 12 to be the set of all such coefficient holds: 

12 := ja(-) : a : R d -*■ M. dxd is Lebesgue measurable, satisfies (1.2) and a 4 = a} . 
We endow 12 with the translation group {T y } yeR d, which acts on 12 via 

(T y a)(x) :=a (x + y), 

and the family {X{U)} of a-algebras on 12, with X{U) defined for each Borel 
subset U c R d by 


T(U) := o algebra on 11 generated by the family of maps 

a f q • a(x)pip(x) dx, p,qeR d , ipeC^iW 1 ). 

Ju 

Roughly, X(U) contains the information about the coefficients restricted to U. 
We denote the largest of these cr-algebras by T := J r (R d ). The translation 
group may be naturally extended to T itself by defining 


T y A := {T y Si : a e A} , A e T 

and to any random element X by setting (T z X)(a) := X{T Z a). 

Throughout the paper, we consider a probability measure P on which 

is assumed to satisfy the following two conditions: 

(PI) P is stationary with respect to ^-translations: for every z e Z d and 

AeP, 

P [A] = P [T Z A]. 

(P2) P has a unit range of dependence: for every pair of Borel subsets U, V £ 
with dist(f/, V) > 1, 

T{U) and T{V) are P-independent. 

The expectation of an JP-measurable random variable X with respect to P 
is denoted by E[A], 
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1.3. Notation. We continue with some notation used throughout the paper. 
For a measurable set E c we denote the Lebesgue measure of E by \E\ 
unless E is a finite set, in which case \E\ denotes the cardinality of E. For a 
bounded Lipschitz domain [/cl d with \U\ < oo and p e [1, oo), we denote the 
normalized LP(U ) norm of a function / e L p {U) by 

1 

(!-3) WfWmu) '■= (^l/( x )l P dx ) P ■ 

It is also convenient to denote |/|l°°(c/) := ||/||l~(u)- For a vector-valued F e 
L p (U ; M d ), we write := |||i ? |||L p (c/)- The average of a function / e L l (U) 

on U is denoted by 

U)u £f(x)dx. 

For [/cl d with \U\ < oo and vector fields F,Geh 2 ((/;M d ), we denote 
(1.4) (F,G)„:= £ F(x) ■ G(x ) dx. 

Since we also wish to quantify the convergence of various functions in the 
weak L p topology, it is natural to work with W~ 1,p norms. For this purpose we 
introduce the normalized W~ 1,p norm of F e L p ([/;R d ) by 

\\ F \\w~^(u) :=sup{|(F,r?) lf | = r|eW /1/ (h;l‘ i ), (v)u = °> I|V?7(X)|| u>\u) = l} ■ 

As usual, p' denotes the Holder conjugate of an exponent p e [l,oo]. We also 
use the shorthand notation 

(1-5) II-F|£-1(17) II-FIIW- 1 " 2 (I/)- 

Note that W~ 1,p ' and H~ l are (somewhat unconventionally) used to denote the 
duals of VF 1>P /M and AF 1 /M., rather than Wq’ p and H£ We denote cubes of side 
length R > 0 by 

□* = □*(()) , □fl(x):=x + nfl. 

The family of cubes of side length at least one is 

(1.6) C :={a R (x) : xeR d , R> l} . 

The set of (real-valued) polynomials on with degree at most k e N is denoted 
by Vk- The (random) vector space of solutions of (1.1) in U c ]R d is 

(1.7) A(U) := |u e Hl oc (U) : Vu € Hq(U), j X7v(x) ■ a(x)\7u(x) dx - o|. 

Recall that, for each p e M d , the corrector $(-,p) is defined for P-almost ev¬ 
ery a e fl as the unique function (up to a constant) in H^ oc ( M d ) with a 7L d - 
stationary, mean-zero gradient and which satisfies the equation 

-V • (a(x) (p + V$(-,p))) = 0 in R d . 


See [17] for details. 
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1.4. Subadditive quantities and main results. We now recall some objects 
from [4, 2] which play a central role in the paper. Given a bounded Lipschitz 
domain U £ and p,q e we denote the linear function of slope p by 
£ p (x) := p ■ x and define 



which is the natural subadditive quantity representing the energy of the solution 
of the “cell problem,” introduced (in a more general form) by Dal Maso and 
Modica [7, 8] in their proof of qualitative homogenization of convex integral 
functionals. We also define 



which is the natural superadditive quantity introduced in [4] (also in a more 
general form), and is dual to v(U,p). 

The quantity u(-,p) is subadditive in the sense that if U, U\, ..., Uk are 
bounded domains satisfying 

Ui, ..., Uk are pairwise disjoint and | U \ {U\ u u C4)| = 0, 

then 



( 1 . 8 ) 


This is immediate from the fact that a candidate minimizer for u(U,p) can be 
obtained by assembling the minimizers of the quantities v(Ui,p), as these agree 
on the boundaries. (We remark that we are using the term “subadditive” in a 
nonconventional way, due to our normalization, as it would be more standard 
to say that U \U\u(U,p) is subadditive.) Conversely, the quantity p(-,q) is 
superadditive (that is, -p(-,q) is subadditive), since the minimizer for / i(U,q ) 
provides with a candidate minimizer for each p(Ui,q) by restriction. 

Note that the minimizer for p(U,q) will be the solution of a Neumann prob¬ 
lem for (1.1) in U, while of course the minimizer for v(U,p) is the solution of a 
Dirichlet problem. Thus we may think of u(U,p) as “the subadditive quantity 
for gradients” and of p(U,q) as “the superadditive quantity for fluxes.” 

The homogenized matrix a can be defined as the symmetric matrix satisfying 


xP' a P = ! ini E[i/(D R ,p)] 

Z ix—>oo 


(1.9) 


We view the quantities n and v as central to the quantitative theory of homog¬ 
enization. The main step in the quantitative arguments in [4, 2] is to show 
roughly that there exists a(d, A) > 0 such that, for every □ e C, 

| u(\J,p) - i p ■ ap\ < C\p\ 2 [□[ a with overwhelming probability, 

and a similar estimate for p, 

|/x(D, q) + ~q ■ a' 1 (/| < Cjg| 2 |U| ° with overwhelming probability. 

The first main result of this paper improves the rate of convergence from the 
unspecified and tiny a(d,A) > 0 in [4] to any exponent ot<\- 
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Theorem 1.1 (Rate of convergence of subadditive quantities). Let a < 
There exists C(d, A, a) < oo such that for every p,q e □ e C and A e M, 

f logE [exp (A|p|~ 2 |u|“ \v(\3,p) - \p ■ ap|)] < C(1 + A 2 ), and 
^ ^ { logE [exp (A|g|“ 2 |n| a !//(□, 5 ) + ig-a _ 1 g|)] ^(l + A 2 ). 

Testing the definition of p{U,q) with the minimizer of u(U,p) yields 

(1.11) J(U,p,q ) :=v(U,p)-fi(U,q)-p-q>0. 

Moreover, as we will see, J(U,p,q ) can be expressed in a variational form, and 
the optimizer is precisely the difference of the minimizers of p(U, q) and u(U,p). 
In other words, the difference of the subadditive energies is an energy of the 
differences of the minimizers. The proof of Theorem 1.1 is based on a bootstrap 
argument, using the higher regularity estimates, applied to this difference, to 
accelerate the convergence of J(\J,p, &p) to zero. See Section 4.1 for a sketch 
of the bootstrap argument. 

The estimate (1.10) is optimal in the sense that it is false for any a > To see 
this, we note that minimizers for p satisfy (oscillating) Neumann conditions 
while those of u satisfy Dirichlet conditions. Therefore, the gradient of the 
difference of the minimizers of /i(D,a p) and z/(D ,p) will in general be 0 ( 1 ) in 
a boundary layer of at least unit thickness. The proportion of volume of such 
a boundary layer relative to the whole cube □ is of order |lU| -1 / d . Therefore 
J(U,p, ap) should be at least c|lH| _1 / d in general. 

It is the presence of this boundary layer that so far limits our quantitative 
results to consequences of (1.10). On the other hand, one expects the mini¬ 
mizers of u(U,p) and p(U, ap) to be much closer in the interior of U than near 
the boundary, and so if the boundary layer could be neglected, there is hope 
to prove much more precise results. 

Theorem 1.1 can be used to prove an array of quantitative estimates in ho¬ 
mogenization with exponents which, under the strongest mixing assumptions, 
will typically differ from the optimal one by a square root. For example, one 
can show that the L 2 error in homogenization is 0 (e 1/,2_ ), where the micro¬ 
scopic length scale is £, or that the // 1 norm of the two-scale expansion is 
C^e 1 / 4- ). The application we present here is an estimate of the sublinearity of 
the corrector and of the H ~ 4 norm of its gradient (recall that the H~ l norm 
measures weak convergence in L 2 ). It roughly states that 

with very strong stochastic integrability. Note that estimates on the sublin¬ 
earity of the corrector are intimately connected to estimates for the error in 
homogenization. 

Theorem 1.2 (Sublinear growth of the corrector). Let (3 e (0, |). There exists 
a constant C(d,A,f3) < 00 such that, for every p e R d , R> 1 and AeR, 

logE[exp [\\p\- 2 R- 2 ^ (||$(-, p) - {<&{■,p)) Br f L 2 {BR) + |V$(-,p)|| ; -i (S h) )) 

<C{ 1 + A 2 ). 
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Theorem 1.2 is new, even though it is almost certainly suboptimal and does 
not compare favorably with the results of Gloria and Otto [15] under spectral 
gap assumptions, who proved that correctors are almost bounded (see also Glo¬ 
ria, Neukamm and Otto [12] for more results under spectral gap assumptions). 
Indeed, it seems to be an open question (and there is some doubt) whether 
finite range of dependence implies a spectral gap-type inequality in d> 1. The 
best previous result for finite range of dependence was for /3(d, A) > 0 very 
small (cf. [4]). Moreover, as mentioned above, our methods are applicable un¬ 
der essentially any mixing condition and we believe yield essentially optimal 
estimates under weaker mixing conditions (such as mixing conditions so slow 
that spatial averages of the coefficients on scale R converge slower than R _1 to 
their mean). This will be explained in more details in future papers. 

The novelty of Theorem 1.2 is not however in its statement, but in its proof. 


1.5. Outline of the paper. Our results rely crucially on C ktl estimates for 
solutions of (1.1) on mesoscopic scales. This was essentially proved in [4], where 
the higher regularity theory in stochastic homogenization was introduced; the 
precise statements we require are proved in Section 2. In Section 3, we make 
the fundamental observation that the quantity J in (1.11) can be expressed as 
a “modulated energy” of the difference of the minimizers of p and v. 

We break the proof of Theorem 1.1 into two main steps. First, in Section 4, 
we use an induction argument on the exponent a to show that, for every a < A 
p e and cube □ e C, 


( 1 . 12 ) 




< c'lpi 2 !□! 


and a similar estimate with p in place of u. This statement is a priori much 
weaker than the conclusion of Theorem 1.1, but we show in Section 5 that the 
subadditivity of -p and v can be used to upgrade the stochastic integrability 
from L 1 to exponential moments without sacrificing any of the exponent. 

Finally, in Section 6, we present a functional inequality, which appears to be 
new and which we call multiscale Poincare inequality. It has a wavelet flavor 
and converts control over the spatial averages of the gradient of a function in 
triadic subcubes into an estimate on the oscillation of the function itself. This, 
together with the regularity theory, reduces Theorem 1.2 to Theorem 1.1. 


2. Higher regularity on mesoscopic scales 

A cornerstone of the methods in this paper is a quenched C k P estimate for 
solutions of (1.1) on mesoscopic scales. This was essentially proved in [4], where 
the higher regularity theory in stochastic homogenization was introduced. 

The rough statement, which is given precisely in the following theorem, as¬ 
serts that a solution of (1.1) can be approximated, just as well as a harmonic 
function, by kth degree harmonic polynomials - on length scales larger than a 
fixed mesoscopic scale (or the microscopic scale, if k = 0). We let Ak denote 
the set of polynomials of degree at most k which are a-harmonic. 
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Theorem 2.1 (C*^ 1 regularity on mesoscopic scales). Fix s e (0,d) and k e N. 
There exist an exponent 5(s,d, A) > 0, a constant C(s,k,d, A) < oo and an 
T-measurable random variable X : 12 -*■ [l,oo) satisfying the estimate 

(2.1) E [exp (A 5 )] < oo 

and such that, for every R > 2X, v e A{B R ) and r e jx v Rm ; lR ; 

(2.2) inf 

weAk 

Theorem 2.1 is essentially proved in [4], That paper only stated the (uni¬ 
form) C 0 ’ 1 estimate and left (mesoscopic) higher regularity statements for k > 1 
implicit in the proof. To prove the proposition, we just need to expound the 
argument from [4]. 

A second ingredient in the proof of Theorem 2.1, which also plays a key role 
in the bootstrap argument, is the following quenched error estimate for the 
Dirichlet problems proved in [4]. It gives a quenched, deterministic estimate 
for the error in homogenization for the Dirichlet problem on length scales larger 
than a certain random scale of characteristic size 0(1). 

Proposition 2.2 ([4, Theorem 1.1]). Fix s e (0 ,d), e > 0, and let U <= Bi 
be a Lipschitz domain. There exists an exponent 5(d,A,s,e) > 0, a constant 
C(s,d,A,U)<oo and an T-measurable random variable 1Z : -*■ [l,oo), which 
depends only on (d,A,s,e) and satisfies the estimate 

(2.3) E [exp (7£ s )] < oo, 

such that, for every r > 1Z, f e IT 1 > 2+e (rl7) and solutions u,u e f + H^rU) of 
-V • (aVu) = 0 and - V • (a Vu) = 0 in rU, 
we have the estimate 

(2-4) “ \\ u - u \l?(rU) - r 5 l^/Izd+qrU) • 

Remark 2.3. The mesoscopic regularity estimate in Theorem 2.1 implies Li- 
ouville theorems of all orders. These state roughly that, for each k e N, the 
subspace of *4(M d ) consisting of functions which grow at most like o(|x| fc+1 ) 
has the same dimension as Ak- The latter functions correspond precisely to 
the kth order correctors. Note that, in view of the results of [2], Theorem 2.1 
and therefore these Liouvillc theorems hold in much greater generality than we 
present here (e.g., for nonlinear equations and with weaker mixing conditions). 
A similar result was recently proved by Fischer and Otto [9], who developed a 
similar higher regularity theory in the general stationary ergodic setting. Sim¬ 
ilar to [11], they proceed a bit differently: rather than measure the distance of 
an element of A(B R ) to Ak , they measure its distance to the kth order correc¬ 
tors. We believe that while both approaches are of interest, the one presented 
here is more faithful to what a C kjl estimate should be in this context (we 
would like, for example, to measure the C k 4 seminorm of a jth order corrector, 
for j < k, and not get zero, otherwise we gain no information on the correctors 
themselves). See below Remark 4.5 in [3] for some similar comments. 


It w \\l 2 ( l b t ) 


“ii 


\ V \\l 2 (B r ) 
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That a quantitative estimate for the error in homogenization for the Dirichlet 
problem implies higher regularity estimates via a Campanato iteration was an 
idea introduced in [4], Here we formulate a version in the following lemma, 
which is a variation of [4, Lemma 5.1]. 

Lemma 2.4. Fix R > 2, a > 0, p e [1, oo] and u e L p (Br). For each k e N and 
s e (0, R], denote 

D k (s):= inf \\u-w\\ LP(Bs) 

wcAk 

Assume that h e [l, and have the property that, for every r e [h, \R\, there 
exists an a-harmonic function v e C°°(B r ) such that 

(2.5) 1 u - v\\ mBr) < r~ a D 0 (2r). 

Then, for each k e N, there exists a constant C(d,A,k,a) < oo such that, for 
every r € [M4 

(2.6) D*(r) < C (H D k (R) + Cr- 

Proof. Fix k e N. Throughout, we denote by C and c positive constants de¬ 
pending only on (k,a, A) which may vary in each occurrence. In the first few 
steps, we make some preliminary observations and introduce the notation we 
need in the main part of the argument, which begins in Step 3. 

Step 1. We first observe that the triangle inequality and the hypothesis (2.5) 
imply that, for every r e [h, ^i?] and s e ( 0 , \r\, 

(2.7) D k (s) < C U) k+1 D k (r ) + C r~ a D 0 (r). 

Selecting a harmonic function v e C°°(B r ) to satisfy (2.5), we find that 
D k (s)= inf \\u-w\\^ Ba) 

wtAk 

< inf \\v-w\\ LP(Bs) + \\u-v\\ mBa) 

wzAk 

In the last line, we used the fact that any a-harmonic function v satisfies, for 
every 0 < s < \r, 

( s\ k+1 

inf \\v - w|| iTO(Bs) < C - inf |v - w\\ LP { Br ) ■ 

wsAk \ ' / wzAk 

Next we use the triangle inequality and (2.5) a second time to get 

inf I \ v - w \\ L v(Br) - D k {r) + Cr- a D Q {r). 
weAk 
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Substituting into the inequality above, we get 


Di 


(s) < C ^ 1 ( D k (r ) + r~ a D 0 (r)) + C r~ a D 0 (r) 


iC (?) 


k +1 


© 


D k (r) + C - r-“h 0 (r) 


Step 2. We set up the rest of the argument. Fix 9 = 0(A) > c so small that 
CO = |, where C is the constant in (2.7), so that the latter inequality implies 

D k (Or)<^D k (r) + Cr- k ~ a D 0 (r), 

where here and in what follows we set D k (s ) := s~ k D k (s). An iteration of the 
previous inequality gives 

m —1 , 

(2.8) D k (O m r ) < 2 ~ m D k (r) + 2 j ~ m (6 j r y D 0 ( 9 j r ) 

3=0 

provided 0 m_1 r > h. To shorten the notation, we denote rj := O^R, Bi := B r . 
and by w k j the best Ath degree polynomial approximation of u in BR that is, 
w k ,j satisfies 

\\ U ~ w k,j\jjP(Bi) = H u ~ w \\l p (b3) 

Step 3. We now complete the proof of the proposition in the case A = 0. In 
fact, this has been already proved in [4, Lemma 5.1], but we give the argument 
for the sake of completeness. 

First, since we may add constants to both u and v, we may assume without 
loss of generality that Wo,o = 0. It follows that 

1^1,01 < 2Dq(R), 

and hence we easily get 

(2.9) 1^1,01z,~(b h ) + R II Vut )0 || l°°(b r ) - CD 0 (R). 

Using (2.8) we obtain 

X ~ _ d ~ 

— 11^1,m+1 — ^l,m|| LP(B rn+1 ) ~ ^lv wi+l) @ p D\ (Vm) 

^m +1 

m 

< 2- m UD 1 (A) + C £ 2 j - m ry-°Do (r,). 
j =o 

We thus deduce that 

—' m D (r ) 

IVw lim+1 - Viui, m | < 2 - m CD x {R) + U £ 2- 7 _m r, 

3 =0 r 3 

and it follows by summation that 

n D (v } 

IViWi.n - Vuq,o| < CD^R) + C £ r] a -^- 

3=0 r 3 
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provided that r n > h. Combining this with (2.8) and (2.9) leads to 


D 0 (r n ) 


< Di(r n ) + |Vwi, n | < C 


Do(R) 

R 


Taking supremnm then gives 

D 0 (r m ) D 0 (R) 


sup 

me{0,...,n} 


<c- 


R 


cE 


72-1 

cE' 

j=o 


sup 


,-a D 0 ( r i) 


Do m) 


j =0 me{0,...,n} r ri 


provided that r n > h. Letting now n* = n*(d,a) be the largest integer such 
that C T,'j=o r~ a < \ and r n * > h we obtain after straightforward manipulations 
that 


( 2 . 10 ) 


sup 

h<r<R 


D 0 (r) 


<C 


Do(R) 

R 


Step 4■ We complete the argument in the case of general k e N. Following the 
reasoning of the previous step, letting w k +i,j stand for the k th order polynomial 
part of Wk+i,j, we get, by (2.8) and (2.10), 

(2.11) D k (rj) < — ||u - 

3 

— l ( c ) ^ k |^fc+lj — ^k+l,j | ) 

T 3 

< r 3 (2 ~W k+1 (R) + Cr~ k - a ^jP- + |v fe+1 w fe+ ij|) . 

Thus we are left to estimate \\/ k+1 w k +i,j\. Since 

\\^k+i,j+i — ^Tfe+ij+i I lp(b3 +1 ) — ^ (-^fc+i(c+i) ^k+i (c)) 

r i+i 

< C2~W k+1 (R) + 

we deduce that 

( 2 . 12 ) |v fc+ v +lj+1 - v fc+ V +1J | < C2-^ fc+1 (i2) + cv- fc -^4r^- 

it 

Indeed, for any n e N and polynomial 0 e V n , we have that 

1 1 

(2.13) ( -f \4>(x)\ p dxY - ( -f \cj)(rx)\ p dxY > csup \4>(rx)\ > cr n |V n 0| 

\jB r / \JBi ) xeBi 

for a constant c = c(d,n) > 0. This is due to scaling and Lemma 2.6, which is 
stated and proved below. 

Now (2.12) implies after summation that 

W k+1 w k+1 , - V fc+ V + i,o| < CD k+1 (R) + Cr- k - a ^j^. 

Furthemore, since 

||^/c,0 — ^fc+1,01 LP(Br) ~ Dk+1 (^) Dfc^tV) < 2_Dfc(.R) ? 
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we obtain 

\v k+1 w k+ i,o\<CR^ k - l D k (R), 

and consequently 

\w k+1 w k+ ij\ <\w k+1 w k+ i,j ~ V fc+ V + i,o| + |v fc+1 in fc+ i i0 | 


-fc-l n / i ^0 (^) 


< CD k (R) + Cr~ R . 

Combining this finally with (2.11) allows us to conclude with 

' r j\ D 0 (R) 


D k (r,) < c(g)“ D t (R) + Crj«(|) 

from which the statement (2.6) can be easily deduced. □ 

Remark 2.5. Notice that since D k (R ) < H 0 (i?), the estimate (2.6) implies 

Ok M < C (L)‘“ B„(B) + Cr- J (T) C 0 (fl). 

This is a C k ’ 1 estimate on scales r for which the second term on the right side 
is smaller than the first term, that is, for r satisfying 


R 


r > Rk+s. 


In the proof of Lemma 2.4, we used the following fact, which is a simple 
consequence of the equivalence of norms in finite-dimensional vector spaces. 

Lemma 2.6. Fix k e N. There exists a constant C(d,k ) < oo such that, for 
every m < k and w e V k , 

U 'IIl~(Si) - C ll W IIl^Bi) • 

Proof of Theorem 2.1. The proof is essentially the same as - indeed, even con¬ 
siderably simpler than (due to homogeneity) - the argument given in the gen¬ 
eral nonlinear case considered in [4, 2], If we take U - B\ and e(d, A) > 0 to 
be the exponent in the interior Meyers estimate (cf. [2, Proposition B.6] for 
instance), then the solvability of the Dirichlet problem and the conclusion of 
Proposition 2.2, with 1Z given there, ensures that the hypothesis of Lemma 2.4 
is satisfied for h-lZ. The lemma then yields the result for X -1Z. □ 


3. Properties of the modulated energy J 

We begin this section with the simple but key observation that the difference 
between v and fa can be expressed as a “modulated energy” of the difference 
of the minimizers. We define J(-,U,p,q) for functions w € H l {U) by 

J(w,U,p,q ) := /.(- -SJw{x) ■ a (x)Vw(x) - p ■ a(x)Vw(x) + q ■ SJw{x) j dx. 

We denote the maximum of J(-,U,p,q) among solutions of the PDE by 

J(U,p,q) ■= max J(w,U,p,q). 

wtA(U) 

To motivate the definition of J(U,p,q), we show that it is actually a familiar 
object: it can be decomposed into the quantities studied in [4], which are the 
focus of the analysis in that paper. 
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Lemma 3.1. For every bounded Lipschitz domain U £ M, d and p,q e R d , 

(3.1) J(U,p,q ) = v(U,p)-p(U,q)-p-q. 

Proof. Fix p,q e M d , u e A(U) and let v e i p + Hq(U) be the minimizer in the 
definition of v(U,p). Note that v e A(U) and compute 


(U,p) - -f - (Vu(x) ■ a (x)Vu(x) - q ■ Vm(i)) dx - p - q 

(/ \ 2 2 ) 

- i (V^(x) - V^(x)) • a(x) (V^i(x) - V^(x)) 

- p • a(x) (V'u(x) - Vu(x)) + g • (Vm(i) - Vu(x)) j dx 
= J(u-v,U,p,q). 

Here we used integration by parts twice, both taking advantage of the affine 
boundary condition for v to get 


P = 


£ X7v(x) 


dx 


and another, which also uses that u - v e A{U) to get 

£ p ■ a(x) (Vm(x) - Vv(x)) dx - £ Wv(x) ■ a(x) (Vu(r) - Vn(x)) dx. 


We deduce that 


J(U,p,q)= max J(w, U,p,q) 

waA(U) 

= max J(u-v,U,p,q) 

utA(U) 

= max (v(U,p)--f-(Vu(x)-a(x)X7u(x)-q-S7u(x))dx-p-q 
utA(U) \ JU 2 

= v(U,p)-n(U,q)-p-q. □ 

Notice that the proof of Lemma 3.1 gave more than its statement: namely, 
the maximizer in the definition of J(U,p,q) is precisely the difference of the 
minimizers of p(U,q) and v(U,p). 

In the rest of this section, we present some basic properties of J which are 
needed in the bootstrap argument in the next section, and we fix a bounded 
Lipschitz domain U £ throughout. We denote the unique (up to additive 
constants) maximizer in the definition of J(U,p,q) by 

u(-,U,p,q ) := maximizer of J(-,U,p,q) among functions in A(U). 

Notice that u(-,U,0,q) is the minimizer for p(U,q) and -u(-,U,p, 0) is the 
minimizer for v(U,p). 

We next record the first and second variations of the optimization problem 
implicit in the definition of J(U,p,q). Henceforth, we make use of the nota¬ 
tion (1.4). 
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Lemma 3.2. For every p, q e 18L d and v e A{U), 

(3.2) {a57u(-,U,p,q),Vv) u = (-a p + q,Wv) v 


and 

(3.3) J(U,p,q) - J(u(-,U,p,q) +v,U,p,q) = ^ (W, aVv) u . 

Proof. For convenience, for t > 0, set u t ■= u(-, U,p,q) + tv. Compute 
0 < J(uo,U,p,q) - J(u t ,U,p,q) 

- t 2 f-Vn(x) • a(x)Vv(x) j dx 

+ t (a(x)Vu 0 (x) + a (x)p - q) ■ Vv(x) dx. 

Dividing by t and sending t -> 0 gives 

-f (a(x)\7u 0 (x) + a (x)p - q) ■ S7v(x ) dx > 0. 

Ju 

Repeating the argument with -v in place of v yields (3.2). Returning to the 
previous identity and taking t = 1 gives (3.3). □ 


For reference, we observe that by taking v = u(-,U,p,q ) in (3.2) we obtain 
the identities 

(3.4) J(U,p,q ) =i(a Vu(-,U,p,q),Vu(-,U,p,q)) u 

=^{-*P + Q,Vu(-,U,p,q))u. 

It is easy to see from (3.2) that (p, q) ^ u(-, U,p, q) is a linear map from R d xK <l 
into A(U); that is, for every Pi,p 2 , Qi, Q 2 e and s,t e M, 

(3.5) u(-,U,tpi + sp 2: tq x + sq 2 ) = Ri(-, £/,pi, gi) + su(-,U,p 2 ,q 2 ) 

Likewise, J is quadratic: for every p, (/ e l d and t > 0, 

(3.6) J(U,tp,tq) =t 2 J(U,p,q). 

We next show that J(-,U,p,q ) responds quadratically to perturbations near 
its maximum. 


Lemma 3.3. For every p, q e M, d and v,w e A(U), 

(3.7) j || Vv - Vw|l 2 (u) < 2J(U,p,q) - ff(w,U,p,q) - ff(v,U,p,q) 
and 

(3.8) 2 J(v,U,p,q) -J(w,U,p,q) -J(U,p,q) < j ||Vw - VHIl 2 (I0 . 
Proof. For any v x . v 2 £ A(U), 

2 J [ Vl ^ V2 , U,p , q^j-J (ui, U,p, q)-J (v 2 , U,p , g) 

= ^ (a(Vni - Vu 2 ), Vui - 
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Then (3.7) follows by choosing v± - v and V 2 -w and using the maximality of 
J(U,p,q ), and (3.8) follows similarly by choosing v\ - w and V 2 -2v -w. □ 


Lemma 3.4. For every p, q,p', q 1 e 

(3.9) J(U,p + p',q + q') - J(U,p,q ) - J(U,p',q') = {-ap' + q',Wu(-,U,p,q)) v . 
Proof. By the first variation (3.2), 

(aVu(-, U,p, q),S7u(-, U,p + p', q + q')) v = (-a(p + p') + q + q\ Vw(-, U,p, q)) v 

= 2 J(U,p,q) + (-a p' + q',Vu(-,U,p,q)) u 

and, by (3.5), 

(aVu(-, U,p, q), Vtt(-, U,p + p',q + q')) u 
= (-ap + q,Vu(‘,U,p + p' ,q + g'))cr 
= 2J(t/,p + p',g + g') - {-ap' + q',Vu(-,U,p + p',q + q')) u 
= 2J(U,p + jf, q + q') - 2 J ( U,pq') - (-a p' + q', Vw(-, £/,P, . 

Combining the two displays above gives (3.9). □ 


We denote by V</(£/,p,g) the gradient of J(C7,•) at (p,</). By (3.6), this is 
a linear mapping from l d x R rf ^ t which, in view of the previous lemma, can 
be expressed by 

(3.10) V J(U,p,q)(p r , q r ) = (-a p' + q',Vu(-,U,p,q)) u . 

This relation between V J(U,p,q) and the spatial averages of the gradient and 
flux of u(-,U,p,q ) will play an important role in the proof of Proposition 4.1. 
For simplicity, the maps p' >-» V J(U,p, q)(p', 0) and q' VJ(C,p,g)(0,g') are 
sometimes denoted by S7 p J(U,p,q) and V q J(U,p,q), respectively. 

Lemma 3.4 gives the identity 


(3.11) \j(U,p u q r) + \j{U,p 2 ,q 2 ) - J (u, 


P1+P2 gi + <?2 


1 


= ^J(U,p 1 -p 2 ,qi-q 2 ) ■ 


This readily implies the following upper convexity estimate for J in (p, q) and 
the lower convexity estimates for J in the variables p and q separately. 


Lemma 3.5. For every p,p\,p 2 , q, qi, q 2 e 


(3- 12 ) ^J(U,pi,qi) + ^J(U,p 2 ,q 2 ) ~ J (u, ^(pi + p 2 ), ^(?i + q 2 ) j 

< A (|pi - P2I 2 + ki - g 2 | 2 ), 

(3.13) ^J(U,p u q) + ^J(U,p 2 ,q) - j(u, ^(p, + p 2 ),<?j > ^ |Pi ~P 2 | 2 
and 

(3-14) ^J(f/,p,gi) + ^J(U,p,q 2 ) - J (*7,p,^(gi + g 2 )J > 7 ^ \qi - q 2 f ■ 
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Proof. We apply (3.11) together with (3.4) to obtain an upper bound for 
J(U,pi~p 2 ,qi~q 2 ), and rearrange to obtain (3.12). We obtain (3.13) and (3.14) 
in a similar way, using the bounds 

J(U,p,Q) = v{U,p) > ^|p| 2 and J(U,Q,q) = -p(U,q) > t^M 2 - □ 

By subadditivity, stationarity and (1.9), for every □ e C, we have 

(3.15) ^p-ap<E[u(D,p)]. 

By (1.11), this implies 

(3.16) ^q- 5 _1 g< -E[p(D,g)], 
so that 

(3.17) E [J(n,p,q)] > ^p-ap+^q-arQj-p-q. 

Due to Lemma 3.5, there exists a unique, deterministic matrix Q = Q(U) 
such that, for every p e M d , 

P = V q E[J(U,0,Qp)] =E[w q J(U,0,Qp)]. 

Indeed, for each fixed p e M d , we can consider the minimum of the uniformly 
convex, quadratic function 

q»E[J(U,Q,q)]-p-q. 

This defines a linear map from p to the minimum point q(p). The matrix Q is 
defined by q = Qp. It is easy to check that Q is symmetric, and the upper and 
lower uniform convexity of J(U, 0, ■) ensures that Q is positive and in particular 
invertible. In fact, we have 

(3.18) Id < Q < A Id- 
Notice that, for every □ e C, we have 

(3.19) <3(m) < a. 

Indeed, by (3.17), 

±p-Q(n)p = ±Q(n)p-E[v q j(u,o,Q(n)p)] 

= E[J(U,0,Q(n)p)] 

>^Q(U)p-a l Q(n)p. 

Putting p := Q(D) _1 (/ gives Q(n) -1 > a -1 , which is equivalent to the claim. For 
future reference, we notice that since u(-, U,p, Qp) = u(-, U,p, 0) + u(-, U, 0, Qp) 
and -u(-,U,p, 0) is the minimizer of u(U,p), we have that -u(-,U,p,0) e I_ v + 
Hq(U) and 

f S7u(x, U,p, 0) dx - -p 

Ju 

and thus by (3.10) that 

V q E[J(U,p,Qp)] = WqE[J(JJ,0,Qp)] + W q E[J(U,p,0)] =p-p = 0. 
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Similarly, we denote by P - P{U) the deterministic, symmetric matrix de¬ 
fined via the relation 

q = VpE [J(U,P q, 0)]. 

As above, we have the estimates 

\h < P(U) < I d , 

for every □ e C, 

p(n) < a 1 

and 

(3.20) V q E[J(U,Pq,q)]=0. 


We next give an estimate for the difference between Q(U) and the homoge¬ 
nized coefficients a in terms of the expected size of J(\3,p,Q(\3)p). 

Lemma 3.6. There exists C(d, A) < oo such that, for any □ e C, 

|a- Q(m)| + \ar l - p(n)\ < Csup \p\~ 2 E[J(a,p,Q(a)p )]. 


Proof. We prove only the estimate for |a- Q(D)|. The argument for the bound 
on |a _1 - P(m)| is similar. We drop the dependence on □ and write Q = <5(Q) 
and J(p,q ) = J(\3,p,q). Set 

rj := sup \p\~ 2 E[J(p, Qp)] . 


By (3.17), we have, for every p e M d , 
0 < • ap + i(5p • a L Qp 


p-Qp<E [J{p, Qp)] < f]\p\ 2 . 


It follows by uniform convexity that, for every p e M. d , 

\Qp- ap| 2 < Crj\p\ 2 

and thus, by upper uniform convexity of q ^ E \J(p,q)] and the fact that this 
map achieves its minimum at q = Qp , we deduce 

E [J(p, ap)] < Crj\p\ 2 . 

Using (3.1), (3.15) and (3.16), we get 

p • ap < E [ J(p, 0)] + E [ J(0, ap)] < p • ap + Cp\p\ 2 . 

In view of (3.15) and (3.16), this implies that, for every p, q e M d , 


E UGhO)] - -p-ap 


< Cr)\p\ 2 and 


E[J(0,g)] 


-q ■ a 1 q 


< Crj\q\ 2 . 


From this, the definition of Q and the fact that J is quadratic, we get 

ip-Qp= iQp-E[VgJ(0,Qp)] = E [J(0, Qp)] < ^Qp ■ a^Qp + Cp\p\ 2 . 

Thus Qa^Q > Q - Cr)I d , that is, Q(a~ x - Q~QQ > -Cr)I d . By (3.18), this 
implies Q > a - Crjl d . In view of (3.19), the proof is now complete. □ 
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4. The bootstrap argument 


This section is devoted to the proof of the following estimate for J. 

Proposition 4.1. For every a e (0,^), there exist C(d, A, a) < oo such that, 
for every p e and □ e C, 

(4.1) E[J(D,p,5p)] < C'|p| 2 |D|~ a . 


An immediate consequence of Proposition 4.1 is 

Corollary 4.2. For every a e (0,1), there exist C(d,A,a ) < oo such that, for 
every p,q and □ e C, 


(4.2) 


( 1 
E[u(n,p)] - -p ■ ap 


E[//(□.9)] + TjV'* lq 


<C\p\ 2 \n\- a , and 
< C\q\ 2 !□!'“. 


Proof of Corollary f.2. By (3.1) and Proposition 4.1, 

(4.3) E[;/(□, p)] -E[/i(D,ap)] -p-&p< C|p| 2 |n| _a . 

Combining this with (3.15) and (3.16) yields the result. □ 


We begin the proof of Proposition 4.1 with some reductions. First, observe 
that it suffices to demonstrate (4.1) for cubes of the form R > 1, since 
we can then apply the result to translations of the law P (recall that we only 
assume Z d -stationarity). Second, in view of Lemma 3.6, in order to prove (4.1), 
it suffices to show that, for each a < there exists C(d, A, a) < oo such that, 
for every R > 1 and pel 11 , 

(4.4) E[J(n R ,p,Q(n R )p)] <C\p\ 2 R- da . 

The proof of (4.4) is by induction. For each a e (0,1) and K > 1, we let 
5(a, K) be the assertion that, for every R > 1 and p e M d , 

E [J{U R ,p,Q{U R )p)]<K\p\ 2 R-^. 

In view of Lemma 3.1, the base case of our bootstrap was proved in [4], 

Proposition 4.3 ([4, Theorem 3.1]). There exists ao(d,A) > 0 and Ko(d,A) < 
oo such that 

(4.5) S(a 0 ,K 0 ) holds. 

The previous proposition was proved in [4, Section 3] by showing that min- 
imizers of p{U n ,q) for large n are expected to be flat- that is, close to a 
deterministic affine function- and this allows comparison to v(\3 n ,p) for an 
appropriate p (the slope of the affine function). Note that the result was 
proved with a in place of Q(Llft), but since Q(C_r) is the minimum of the map 
q ^ E [ J(\J R ,p, (/)], we can make this replacement. 

In view of Proposition 4.3, it therefore suffices to show that there exist 
e(d, A) > 0 and C(d,A) < oo such that, for every a e [a! 0 , -) and K > 1, 

S(a,K) ^=> S + e(l - da),CK^ . 


(4.6) 
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Indeed, an iteration of (4.6), starting from 5(ao,/fo), yields (4.4). 

The rest of this section is focused on the proof of (4.6). Throughout, we fix 
a e [cr), 2 ) and K > 1, and assume that S(a, K ) holds. We fixfi>l and p e W d . 
By homogeneity, we may assume that \p\ < 1. We set 

(4.7) q:=Q(n R )p, 

where we recall that Q(Dr) is dehned before Lemma 3.6. Throughout, 

v:=u(-,n R ,p,q) 

denotes the maximizer of J(0 R ,p,q). Notice that Lemma 3.3 gives 

( 4 - 8 ) \Wv\\l^ DR) <U(n R ,p,q). 

In particular, the induction hypothesis yields 
(4.9) E [IMli> (Da) ] <4KR->«. 


4.1. Rough sketch of the argument for (4.6). Before giving the complete 
details, we present an informal summary of the proof of (4.6). Choose a meso¬ 
scopic scale r e (1, R) and partition the cube into smaller cubes of the form 
□. r (y), y € D/j. Observe that, using (3.2) in the last step, 

Y,J( V > n r(y),P,Q) 

y 

E < _a p + q>w) ar{y) 

y 

E(a V%,Vv) Dr(l/) , 

y 

where u y u(\J r (y),p,q). If the induction hypothesis S(a,K ) holds, then, 
using also (4.9) and the Lipschitz estimate, we have 

(4.10) E[|V%l| a(nr(>)) ]sCr-' fa and E[|| W||| a(Dr(>)) ] < CBT^. 

No improvement on the estimate of E[ J(\J R ,p, g)] can be obtained from these 
observations alone, as we have not yet used the mixing properties of the coef¬ 
ficients. 

If v is sufficiently close (in say L 2 ) to an affine function of slope p y in the 
mesoscopic cube □ r (y), then, since u y is a solution of (1.1), we may integrate 
by parts to get 



(aVu y ,Vu) Dr(j/) = {a\7uy,py) ar{y) + a small error. 
Recall that the dual vector q was chosen in (4.7) so that 


E 




= 0 , 


Hence, the sum 

( 4 - U ) (7) E E [< aV ffin^) nr G,)] 


that is, E \p y ] = 0. 

y 


/ P\~ d 

r 

(?) 

p y - -1- a Vu y (x) dx 

- ' JOyir) 






MESOSCOPIC REGULARITY IN ELLIPTIC HOMOGENIZATION 


21 


can be seen as a measure of the correlations between the (discrete) random 


fields (/□„(!,) aV “y) and (Pv)- 


The main insight is that the higher regularity theory guarantees that the 
slopes p y exist, that the approximation to v has a sufficiently small error, and 
that p y changes very slowly in y. Meanwhile, the spatial averages of the fluxes 
(/□r (y) are essentially independent as y varies. If we denote by s a 

mesoscale with r « s « R on which the vectors p y are approximately constant, 
then we can expect to bound (4.11) by 

/ o \ - rf / 2 

(4.12) R -da/2 r -da/2 l\ 

Indeed, by (4.10), R~ da / 2 is the expected size of each p y and r~ da / 2 is the ex¬ 
pected size of the spatially averaged flux in a mesocube. The factor of ( s/r)~ d l 2 
comes from the CLT scaling, as ( s/r) d is the number of smaller mesocubes of 
size r in each larger mesocube of size s. 

The expression in (4.12) can be made smaller than R~ da by choosing the 
mesoscales r and s appropriately, provided that a < 1. This suggests that 
the correct estimate for E [«/(□#,£>, g)] should be (up to possible logarithmic 
factors) R~ d . The argument we present below does not perform so well, and 
saturates at R due to a boundary layer we have neglected in this sketch. 

4.2. Declaration of parameters and mesoscales. We now proceed with 
the rigorous argument for (4.6). We take mesoscopic scales r,s,l> 1 such that 


(4.13) 

and 

(4.14) 


R l s 
—, —, - 6 N 
l 3s r 


r = C'i? 1_ ( m+2 ) £ 


l = CR 


1-e 


s = CR l ~ 2£ , 

for fixed parameters e e (0, and m > max{d, 2} to be selected below. Here 
the constants C in (4.14) are very close to 1 and are chosen so that the con¬ 
straints (4.13) are satisfied. 

The largest mesoscale l denotes the thickness of a boundary layer we remove 
from IH /2 in the first part of the argument; it also serves as a reference scale for 
the regularity estimates we will apply at the smaller scale s, where we apply 
Theorem 2.1 to obtain polynomial approximations to v. Finally, r denotes the 
size of the smallest mesoscale cells in which we compare v to local solutions 
(denoted by v z below) chosen to match the polynomial approximations made 
on the larger scale. Each of these mesoscales, even the smallest, will be chosen 
to be very close to the macroscale R. 

The degree of the polynomials used in the mesoscopic approximation will be 
fceh, also to be chosen. We let the exponent 5 - 5{d, A) > 0 be the minimum of 
the exponent given in the statement of Theorem 2.1 for the choice s = 1 and the 
one given in the statement of Proposition 2.2 for the choices s = 1, U = 
and e = 1. We may assume <5 < 1. We let X denote the maximum of the random 
variables X and 1Z from Theorem 2.1 and Proposition 2.2, respectively, with 
the same choices of parameters. 
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In order to apply Theorem 2.1 from scale l to scale s, we must have that 
(4.15) s>lk+%. 


For this, it suffices to impose the restriction 


(4.16) 


<5 

£ ~ 2{k + iy 


In the course of the proof, we will also find it convenient to fix the parameters 
m, k, £ as follows: 


(4.17) 
and 

(4.18) 


m 


2(1 +a) 2d(l + a) 
1 - a 2- da 


v d 


k:= 


10d(l + Q)m 
6 


and e 


a 


5 


59 


A 


A 


2(m + 3) 2{k + 1) 20d(l + 9)m 


Here the positive parameter 9{d, A) is the exponent related to the local and 
global Meyers’ estimates, which in particular give us that, for every ijj e VF 1 ’ 2 ( 1+el )(n r (y)), 
w e A(n 2 r (y)) and w e + H^(D r (y))) n A(O r (y)), 


( 4 - 19 ) |Vw||i2(i +e)(nr(2/)) < C\\Vw\\ LHa2r(y)) 

and 

( 4 -20) II y' w \\L 2 ( 1+e '>(n r (y)) ~ C I V^lL 2( i +s )(D r (y)) ' 

Notice that, since a e [a 0 , 2 ), we have e > c(d, A) > 0. In fact, each of k, m, and 
£ are bounded above and below by constants depending only on (d, A). This 
implies that each of the constants C and c in the estimates below will depend 
only on (d, A) instead of (d,A,k,m,£). 

The improvement in the exponent a will be the result of a CLT scaling 
arising due to the fact that the u 2 ’s inside each s sized cell are independent 
of each other. This is seen in the proof in Step 3, below. In the final step 
of the proof, we will define each of the mesoscales and other parameters in 
such a way that the error terms encountered in the argument will be at most 
C K R-du-etl-da) 


4.3. Removal of boundary layer. The improvement in the exponent a is 
based on the application of regularity estimates to v. Since we do not have a 
boundary condition for v, these estimates are inapplicable near the boundary 
of the macroscopic cube □#. Therefore we must remove a boundary layer, 
which is accomplished very simply by subadditivity. It is the error we make 
here that forces our bootstrap argument to halt before da - 1. 

As R/l is an integer, the cubes {Oi(y)} y ^ lZ d nDR form a partition of (up 
to a set of Lebesgue measure zero). We let denote the cube obtained from 
□k after removing a mesoscopic boundary layer of thickness /: 

: = Or ' U : y e ^ da i n * 0} • 

Notice that 

I 












MESOSCOPIC REGULARITY IN ELLIPTIC HOMOGENIZATION 


23 


Then by subadditivity, stationarity, Lemma 3.6 and the induction hypothesis, 
we obtain 


(4.21) 


E[J(n*,p, 4)1 


< ^E[J(a^,p,g)] + ^E [./(□* X n° R ,p,q)] 


!□* 

!□ 


R 


!□ 


R 


Pr\ 

E[J(D” R ,p, 9 )] + CA-(y) 


l-da 


In the second inequality in the display above, we used the fact that \ is 
a union of cubes of the form y + □*; the induction hypothesis and Lemma 3.6 
imply that 


(4.22) E [\Q(n R ) - Q(y + □,)!] < CI<l ~ da , 


so that another use of the induction hypothesis gives the desired estimate. 
Throughout the rest of the argument we denote 

Z := rZ d n n° R . 


4.4. Local mesoscopic approximations of v. The next step in the argu¬ 
ment is to introduce local approximations of v in each mesoscopic cube □ 3r (?/), 
with y e Z. These are also solutions of (1.1), that is, members of „4(U 3r (?/)). 
The advantage will be that the approximations, which we denote by v y , serve 
to localize the dependence on the environment and allow us to exploit the 
independence assumption. 

We first introduce polynomial approximations of v in the larger mesoscopic 
cubes of the form □ 3s (^), z e s7L d n With k e N to be defined below, we 
select w z e Ak such that 

( 4 - 23 ) \\v-w z \\LHn 3s (z)) = in 4 • 

weA-k 

That is, w z is the best approximation to v in L 2 (D 3s (^)) among a-harmonic 
polynomials of degree at most k. 

We next present some basic estimates concerning the expected size of w z 
and the quality of the approximation to u. This is where we use the higher 
regularity estimates in Theorem 2.1. 

Lemma 4.4. Assume that (4.16) holds. Then there exists C(d,A,k ) < oo such 
that, for each z e slA n 

/ c\ 2fc+2 

(4.24) e[|«-» 4 J(Ds . W) ] E[||VB|| 1(DlW) ] + Cjr M - 1 , 

(4.25) ^[I^' u lIl~(d 3s (*))] - C*E[|Vu||^ 2 ( Di ( z ))] + CR 2d 1 , 
and, for every m e {2, ..., k}, 

(4.26) E[s 2 ( m_2 ) |V m w z ||^oo (n3s( ^ )) ] < C , / _2 E[||Vu||^ 2 (i - I;(z)) ] + Cs~ A R~ 2d ~ 1 . 
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Proof. We break the argument into three steps. 

Step 1. The proof of (4.25). Since w z is an a-harmonic polynomial, we have 


(4.27) 


Vw z 


L°°(n 33 (z)) 


C I, 

< — \\w z 

s " 

C I, 

< — n - 

s 11 


< C [I Vu 


( w )n 3s (z) Hl 2 ( d 3s( z )) 

( i ’)n 3s (z)|^2 (n3s( , )) 
L 2 (D 3s (z)) j 


where, in the above display, we used Lemma 2.6 in the first line, the optimality 
of w z in (4.23) tested against the constant function (n) D3s ( 2 ) and the triangle 
inequality to get the second line, and finally the Poincare inequality to get the 
last line. Squaring and taking expectations, we get 


E [| VlU Z || i oo( n3s ( 2 ))] 

< CiqiVi'llt., 1(t,x<c,}] 1 C'E[||Vy||L2 (D3s(z ) ) l {T.X>Cs}] 
<CTE[||V«|||2 (niW) ] + CE[||V«|| L 2 (D 3s (z)) 1 {T z X>Cs}\ • 


To bound the second term on the right side, we use (4.8) and the (deterministic) 
estimate J(\J R ,p,q) < C to very crudely bound 

|Vi>l7e„ w) < C S ~ d R d < CR d 


and then combine this with strong integrability of X , which give the following 
estimate: 


(4.28) E[l {Tz * >s} ||Vu||^ 2 (D3s(2)) ] 

< CR d+2 P [X > s] < CR d+2 exp (-cs) < CRT 2d ~ l . 

In the last line we used s > Rh from (4.14). This completes the proof of (4.25). 

Step 2. The proof of (4.24). According to Theorem 2.1 (recall that the assump¬ 
tion (4.16) implies (4.15)), we have 


/ $ \ 

\\ V ~ W z\\ L 2 (D 3s (z)) 1 {T z X<Cs} < C ^ ~ j / II Vv||x,a (Dl(z)) 1{ T z X<Cs}- 

Recall that { T y } yeR d is the translation group acting on the probability space hi. 
In the event that T Z X is too large compared to s, we proceed differently. Ap¬ 
plying (4.23) and the Poincare inequality, we get 

I v “ ^li 2 (D 3s (A) < \\v ~ («)□,(*)IL 2(D3s(2)) T z X>Cs} 



^ - (^)nz(^) II l 2 ( DjW ) 


VU IU a (Dl(*)) ^~{T z X>Cs} ■ 
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Combining the previous two displays yields 


W v W A\L 2 (a 3s (z)) 

, k +1 


-(i) 


l \ 2 


1 l|Vw||L 2 ( Di ( 2 )) 1{ T z X<Cs } + Cl 
Squaring and taking expectations, we obtain 

. 2fc+2 


I Vu Il 2 (d i (*)) 1 { T z X>Cs}- 


( v ZAC+Z 

j) ; 2 e[iiv«.| 




Cl 2 {~s) E [^--> H v ^i 2 ( n iC))] • 


To estimate the second term on the right side, we proceed in the same way as 
the end of Step 1, above, to get 

d 


t- 


(£)>, 


ll 2 

ll h 2 (n,C)) 


]<CR 


2d-l 


1 {T [y] x>s} ll Vn| 

This completes the proof of (4.24). 

Step 3. The proof of (4.26). We take i z € Ai to be the best affine approximation 
to v: 

lk-4L 2 (D3sW = inf \\v-£\\ L 2 {a3a{z) . 

Using the triangle inequality and (4.24) twice, once with k as above and once 
with k - 1, we get 

. 4 


E Hue - 


Clf( D ,.M)b« 2 ()) E[| 




+ CR 


-2d-l 


Since w z - £ z is an a-harmonic polynomial, this yields, by Lemma 2.6, 


E 


mm s 2(m-2) ||w rn 7 ,, || 2 

SUp S || V w z\\L°°(a 3s (z)) 

me{2,...,fc} 


CE[.> 


—4 


w,-£ 


UlL°°(n 3s ( 2 )) 


< Cr 2 E [II Vw|||j (D i(z)) ] + Cs-'R 


—2d—1 


□ 


which is (4.26). 

We next introduce the local mesoscopic approximations to v, which are based 
on w z . For each y € Z, we denote by [y] the unique element of slA n such 
that y e □ s ([y]). For each y e Z, we let v y denote the solution of the Dirichlet 
problem in □ r (y) with boundary condition W[ y y That is, v y is the unique 
element of *4.(Ll r (y)) n [w^ + iL(((n r (y))). Note that, for every (j) e ^4(D r (y)), 


(4.30) 


(a (VVy - V%]) , V0) 


D r {y) 


= 0 . 


We next give the estimate for the expected difference between the gradients 
of v and v y , using the previous lemma. 

Lemma 4.5. Assume that (4.16) and (4.18) hold. Then there exists a constant 
C(d,A,k) < oo such that 


(4.31) I^EEp Vv-Vi>j| a(Dr(v)) 

ytZ 


< CKR 


-dot—Adme 
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Proof. Set rj y v - v y and let 
(4.32) t := r - r 1-£ , 


6dme(l + 6) 


where 6(d, A) is as in Meyers’ estimates (4.19) and (4.20). Observe that the 
choice of e in (4.18) guarantees that e <\. We first split each of the summands 
into two pieces as follows: 

(4.33) f \Vq y (x)\ 2 dx = f \S7rj y (x)\ 2 dx + f | V^(x)| 2 dx. 

Ja r (y) Ja t (y) Jy+a r \a t 


The first term we will estimate with the aid of the Caccioppoli estimate and 
Lemma 4.4, and the second one using the Meyers’ estimates. Indeed, we have 
by the Caccioppoli estimate that 


— f 

|LU Jo 


Wvy(x) | dx < — 
□t(0 {r 


C 


f . . \%(x)f 

JOr(y) 


t ) 2 Ja r ( y ) 


dx. 


To estimate the term on the right, we have by the triangle inequality that 



dx < 2 p - W [ y ] | 2 


(□r(2/)) 


+ 2 II Vy-W\ y ] 


2 

L 2 (a r (y)) ' 


Now, for the first term we have 


E 

zesZ d nD° 


\\ v w z\\L 2 (n s {z)) 


It w z\\ l 2 (d s ( z )) i 


and thus we obtain by (4.24) that 


i^r 1 E 


ytZ 


W M\\L 2 (a r (y)) 


sc (5)‘-^( p (Tr E[iv,,i ^‘->> ]+c7jr * 1 ) 

On the other hand, using Proposition 2.2 and (4.25) we get 

E SCr^Eflvt^rilL^))] 

<Cr 2 - M (E[||V t ,||| 1(D , w) ] + Cfl- M - 1 ) 


Summing this over Z implies 
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Connecting above estimates and using the fact, from (4.18), we have 5 >e, we 
arrive at 


(4,34) Ur'Z E ff Av^xfdx 

L-'Dt(y) 

(pnr 




~i —s 


r-t, 




By the choice of k and t in (4.18) and (4.32), respectively, we have 

|_L_ j ^(1 j - R 2 (me+e-ke) _ ^-2e(fc- 6dm < 1+9) -m) < j^-idme _ 

and by the condition for e in (4.18), 


~l—5 


r-t 


= R 


-2(l-e(m+2)) (<5- 


6cLm(l+9)e 


) 


0 - ) < 


Therefore the induction hypothesis yields by way of (4.9) that 
(4.35) \Z\~ X Y, E [ -f |V^(t)| 2 dx 1 < CKR~ da ~ 4dm£ . 

yeZ Uo t (y) 


To treat the second term in (4.33), Meyers’ estimates (4.19) and (4.20) pro¬ 
vide us, via Holder’s inequality, 


f |V??(x)| 2 dx< 

11—1 y* I J N Hilt 


Pr x □*[ 

Pr| 


( -f \\/i](x)\ 2+2e dx] 
\Jn r (y) ) 


6 

C {~~) + f a3 (y) (|V^(PI + |Vw [y ](x)|) 2 dx. 


Following Step 1 in the proof of Lemma 4.4 and applying (4.25) once more we 
obtain 


E 


e 

X-n l V7 *( X ) 2 (a l (y))] + CR ^ d ~ 1 ) 


p r | ~/y+n T .\Di 

By the choice of t in (4.32) and £ in (4.18) we have 


(P)“” 


_ j^-(l-e(m+2))^g < R-idme . 


Therefore the summation yields, as before, 

1 


f |V?](x)| 2 


dx 


< CKR 


'-da-Adme 


(4,36) |zr‘£E 

ytZ |_ I' —I r | 

Combining (4.33) with (4.35) and (4.36) yields the statement of the lemma. □ 


Next we show, using the previous lemma and the uniform convexity of J in 
p and q separately, that the expected difference between J(v,D r (y),p,q) and 
J(v y ,n r (y),p,q) is small, after averaging over all y € Z. 
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Lemma 4.6. Assume (4.16) and (4.18). There exists C(d,A,k) < oo such that 
(4.37) ^ E E [ \J (u V r (y),P, q)-J (v y , D r (y),p, q)\ ] < CKR d . 


ytZ 


Proof. For convenience, let u y := u(-,\3 r (y),p,q) denote the minimizer in the 
definition of J(\3 r (y),p,q). Using (3.3) twice and summing, we find that 

(4.38) J (v,n r (y),p,q) - J (v v ,n r (y),p,q) 


= g ( Vu " a ( 2Vu v ^ v y))n r (y) 


Thus 


(4.39) \J (u, U r (y),p, q) - J (v y , D r (y),p, q)\ 

<C f |Vn(x) - X7v y (x)\ (\S7u y (x)\ + |Vn(x)| + |Vu y (a;)|) dx. 
Ja r (y) 

We will estimate the term on the right. By (4.22) (with l replaced by r), 


E [ || Vu. 


y Hi 2 (□.(?/)) 


< CE[J(D r (y),p,q)] 
<CE[J(\3 r (y),p,Q(\3 r )p)] + CKr- da 

< CKr~ da . 

On the other hand, by the induction hypothesis and (4.25), we have 


I-2I' 1 E E [ 

<i>Z 


I II2 

\^ v y\\L 2 (a r (y)) 


< cizy 1 E e T | vit» [s/] || 

V*z 

< CKR~ da . 


2 

L. 2 (a r ( y )) J 


Lemma 4.5 further gives 

i^r 1 21 E [ 


ytZ 


J y\\L 2 (a r (y)) 


< CKR 


-da 


Combining above three displays and using the triangle inequality yield 


1^1 E E [I v u v II L 2 (a r ( y )) + II v v y li 2 0=irO)) + H V v H L 2 (n r (y)) 


< CKr 


-da 


We now use Holder’s inequality, Lemma 4.5 and the display above to get 
l^r 1 E E , l Vt, ( x ) - V U/( X )I (|Vu y (a:)| + |Vu(x)| + |Vn y (a;)|) dx 

.~>Or(y) 


ytZ 

<c (e 


|Z| 1 E | Vn - Vw 

ytZ 

p ' jp - do j 2 —2dme - do j 2 


2 

y\\L 2 (a r ( y )) 


CKi r ~ da/2 


Now we return to (4.39) and use the previous estimate to obtain 
jZr 1 Y, E [ I J (v, n r (y),p, q)-J (v y , n r (y),p , q) | ] < CKr~ da ! 2 Rr da/2 R; 


2 dme 


ytZ 


By the choice of r in (4.14), we have r~ da l 2 = CR~ da / 2 R da ( m+2 ) E I 2 , and therefore 
we get the desired result using a < 1 and m > max{rf, 2}. □ 
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4.5. Improving the exponent using independence. As in the proof of 
Lemma 4.6, we set u y := u(-,\3 r (y),p,q). Applying (3.2) and (4.30), we find 
that, for every y e Z, 

(4.40) J (v y ,n r (y),p,q) < {-ap + q, Vv y ) Dr(l/) 

- (aVu„,VUi,) nr(y) = (aVu y ,Vw [y] ) aAy) . 

This together with (4.21) and Lemma 4.6 imply that 


(4.41) E[J(n K ,p.,)]<c(^) E 


E ( a V u »>V%]) 

.ytz 


+ CK 


(?) 


□r(y) 

-1 


l~ da + CKR- d ( a+£ \ 


The sum on the right side of (4.41) allows us to see a CLT-type scaling because 
the terms are essentially independent for each y inside a single larger mesoscopic 
cube Qs^). This is the mechanism which improves the exponent a. The 
precise statement we need is formalized in the following lemma. We remark 
that this is the only point in the proof that we use the choice q = Q(\Ar)p. 


Lemma 4.7. Assume (4.16) and (4.18) hold. There exists C(d,A,k) < oo such 
that, for every z e s7L d n 


(4.42) E 


sup 

wCP k 


|V«'lli-( D . W ) + i||v 2 «)|| i . (n>w)i 


,-2 


2 4 1 


E (aVu,,, Vw> nr(y) 

^yer1j d r\Ds(z) 



n \-d / ; 

r n-2 \ 

(;) H( 

-i | o 

+ 

-J + Kr~ do I 


Proof. We fix z e sZ d n throughout the proof. 

We first prove a bound for each particular w e V k , and then put the supre- 
mum over w inside the expectation, using linearity and that Vk is a finite 
dimensional vector space. We begin with an estimate of the expectation of 
each term in the sum. 

Step 1. We show that, for every w € V k and y € r7L d n □ s (^), 


(4.43) E[(aVu !/ ,V«;) nr(?/) _ 

< CKr -*■ IvHL-tH.w) 4 CKir'-^ || V^|| t . (D , w) . 

We first observe that 

(a VUy,Vw) nr{y) -Vw(y)- f a {x)sju y {x)dx 

<°C sup |V<Rx) - Vm(y)| f M«-)l -• 
xtn r (y) Ja r (x) 
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Thus, by (4.22) and the induction hypothesis, 


(4.44) 


E 


(aVu yi Vic) Dr(y) - Vw(y) ■ + a (x)Vu y (x)dx 

J \3 r (y) 

1 

f , Wu y (x)\ 2 dx 
LoD r (a;) 


< Cr V 2 um 


L~(Dr(y)) 


T - 1 i _da_ M o .. 

< CR zr 2 || v w\\ L oo (ar(y)) . 

On the other hand, using Lemma 3.6 and the induction hypothesis again, we 
get, for every □ e C, 

|a _1 - _P(m)| < CK |D| a , 
from which we deduce, using also (4.22), that 

\p-P(n r (y))q\ <CKr~ da . 

Therefore 


E 


J a (x)Vu y (x)dx 
o r (y) 


= |E[v p J (□ r (y),p,g)]| 

< |E [S7pJ (O r (y),P(n r (y))q, g)]| + CKr 
= CKr-** , 


-da 


as E [V P J (n r (y), P(n r (y))q, q)] = 0 by (3.20), and hence 


(4.45) 


E 


S7w(y)- j- a(x)S/u y (x) dx 
Jn r (y) 


< CK\S7w(y)\r 


-da 


We now obtain (4.43) by combining (4.44) and (4.45) and the triangle inequal¬ 
ity. 

Step 2. We show using independence that, for every w e P k , 


var 


E ( a ^ u v^ w )a r ( y ) 

yer1j d nOs(z) 


(4.46) 

We expand the variance by writing 


< CI\r~ da 




Vie 


L°°(U s (z)) ■ 


var Y, ( a V%,V^) Dr( p) 

y^r7j d nD s (z) 

E cov[(aVUp,Vu;) nr(3/) , (aS7u y ,^w) a , yl) . 

p,j/'erZ d nn s (z) 

Using independence, the fact that (a Vu y ,S7w) Dr ^ is T (□,.(y))-measurable 
and each cube n r+ i (y) has nonempty intersection with at most C cubes of the 
form □ r+1 (y') with y' 6 rZ d n □ s (z), we obtain 


E cov [ < aVi O’ ^ w )a r (y) ■■ ( aV V» Vru) 


y,y'erZj d nn s (z) 


□r(y') 


< c y var 

yerZi d nn s (z) 


(a VUy,Vw) nr{y) 
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Finally, we observe that the induction hypothesis gives, for each y e rZ d nD s (z), 


var 


(a Vu y ,Vw) Ur{y) 


< CTe[||Vu 

<CKr~ da II 


|| 2 

y II l 2 (cM?/)) 

II2 


VlC 


\7w\ 


L°°(a s (z))- 


2 

£“(□„(*)) 


This completes the proof of (4.46). 

Step 3. We complete the proof. Observe that combining the results of the first 
two steps gives, for every w eVk, 


(4.47) 


E ( aVM ^vw) Dr(y) 

yerZ d nO s (z) 


/ , 

s\ 2d // 

Q \~ d /i 

1 \ -2 \ 


_) ( 

”) + (" 
r) \i 

-j +/\r- da ) 


x(||v^II^ (D s( , )) + /||v 2 u;|| loo(Ds(z)) ) 2 . 


To conclude, it remains to smuggle the supremum over w e Vk inside the expec¬ 
tation. As Vk is a finite dimensional vector space with dimension depending 
only on (k,d), there exists an integer N(k,d ) e N and wi,...,wn e Vk such 
that {u>i,... ,wn} is a basis for Vk, each wj satisfies 

|V^j| L oo( Ds ( z )) + l | V Wj | L oo( Ds ( z )) = 1) 


and for any w e Vk expressed in the form 


W = C\W\ + ••• + CnWn, 


we have, for some C(k,d ) > 1, 


|ci| 


M<c(| 


Vw 


L“(D s (z)) 


+ 1 


|v 2 w| 


£“(□* 0 )) 


)■ 


It follows that 


sup (||Vio|| I . (n , w) + (||v 2 ro || i „ (D-W) ) 1 


wtVk 


E ( aVM </> Vw )o r(!/) 

yerZ d r\D s (z) 


N 
3 = 1 


E ( a V«»,Vi"j) Dr(y) 

yerZ d nD s (2:) 
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Squaring and taking the expectation of the previous inequality, using that 
N < C and then applying (4.47), we get 


E 


sup(||Vn;|| Loo(Ds(z)) + /Iv 2 n;| Loo 

wzVk 


d 


E <aV%,V«;> nr(y) 

ytr'Z d nn a (z) 


<CE 

(e 

E ( aVu ^v^) Dr(?/) 

)1 


Lv- 1 

j/erZ d nn s (z) 

/ J 


N 


<cyE 


3 = 1 


2-i 


E (aVrty, Vu’j) Dr(?/) 

\y^rZ d nDs(z) 

, -2 

+ Kr~ da 




This completes the proof. 

Combining (4.40) and Lemma 4.7, we obtain 

. d 


□ 


E E [^ ( v v ,n r (y),p,q )] <CK* (-) 

y*Z ' r ’ 

E B[( 


_ da 
T 2 


(?) ’ + (r) 


+ K 2 r 2 


VwJl°o( D s( >)) + ^ l v V|| L .( Ds (,)) / 

By Lemma 4.4 and (4.9), the sum in the line above is bounded by 


zes Z d na° R 


C 






so that 


da _ da 

R 2 r 2 


E E (%> Q r-(2/),P, ?)] < CK (—) 

veZ V r ' 

Combining this with (4.41) and using K > 1, we obtain 

E[J (D^p,?)] 


(?) + (;) 


+ K^r~ d f I. 


< CK 2 I i? 2 r 2 


r-^r 


+ R C r da + 


(ff 


_j_ j^~d(a+e) I 


Using the dehnitions of the mesoscales, 

f _d 

2 =CR~ da R~ d£ ( f(i- a )- a ) 


da _ do: 

it 2 r 2 


da _ da 

R 2 r 2 


(;f 


_ R—dt* J^-£(m+l-da(m+2) /2) 


R~^T rf-dt* — C R-dc* J%de(m+ 2)-4^ 

D \ -1 

L * l~^ a — CR~^ a R £ ^ a ~ 1) 


(?) 
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It is easy to check that that the choices made in (4.16), (4.17) and (4.18) 
guarantee that the first three terms above are bounded by R~ d ^ a+e \ 

In view of these choices of the parameters, it is clear that the third term (the 
error in removing the boundary layer) is the limiting one, and we deduce that 

K[J(n R ,p,q)] < CK(FT da R £ ( da -^ + R~ d ( a+ A) < CKR~ da RR da ~A . 

Thus <S(a! + e(l - da)/d,CK) holds, for £ > c(d, A) and C(d,A) < oo which, 
after a redefinition of e, completes the proof of the claim (4.6) and therefore 
the proof of Proposition 4.1. 

5. Improvement of stochastic integrability by subadditivity 

The aim of this section is to complete the proof of Theorem 1.1, by strength¬ 
ening the stochastic integrability of Proposition 4.1 (or Corollary 4.2) from L 1 
to exponential moments. The rough argument is as follows. We decompose 
□/j into subcubes (□,.(?/)). By subadditivity, we can bound the upper fluctua¬ 
tions of v(U R ,p) by the upper fluctuations of the average over y of u(\3 r (y),p), 
up to an error controlled by the difference E[z/(U r ,p)] - E[V(Lffi,p)], which is 
small by Corollary 4.2. By independence, the average over y of v(n r (y),p) 
is unlikely to be large. The same argument applied to -p gives a control of 
the lower fluctuations of p. By duality, we can then control upper and lower 
fluctuations of both quantities. 

We make this idea precise in the following general statement. (Recall our 
slightly non-standard definition of subadditivity in (1.8), and that C is the set 
of cubes □ such that |U| > 1.) 

Theorem 5.1. Let 5 < 1, R® < oo, and letp(-) < z7(-) be respectively super- and 
subadditive quantities, such that for every R> R 0 and x e 

p(0 R (x)) andv(n R (x)) are tF(\J R+R s(x))-measurable. 

Let a > 0 and f3 e (0, a) n (0,1/2]. Assume that there exists c < oo such that for 
every cube □ e C, 

(5-1) E[i7(D)]-E[jr(D)]<j^. 

Assume furthermore that there exists c' < oo such that for every cube □ satis¬ 
fying 1 < !□! < 2 rf and every A el, 

(5.2) E[exp(A/7(m))] v E[exp(Az7(D))] < c'(l + A 2 ). 

Then there exists C = C(d, 6, R 0 , a, /3, c, d) < oo such that for every cube □ eC 
and every AeR, 

(5.3) logE[exp(A|D| /3 (P(D) - E[P(D)]))] < CA 2 , 
and the same estimate holds with v replaced by p. 

Remark 5.2. The conclusion of Theorem 5.1 can be strengthened in several 
directions. First, one can replace A 2 by A 2 a A 1 -^ in the right side of (5.3). 
Moreover, we believe that (5.3) holds with a = (3 for a <\ (and with a logarith¬ 
mic correction when a - |), and in fact, the proof given below can be adapted 
to show this stronger result when a is sufficiently small. On the other hand, 
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even if /i and v were additive, one could not improve the conclusion to /? > 
by the central limit theorem. 

The proof of Theorem 5.1 makes use of the following lemma. 

Lemma 5.3 (parabolicity of log-Laplace). There exists Ao e (0,1] such that the 
following holds. Let a > 0 and X be a random variable such that E[exp (|X|)] < 
oo and E[X] = 0. If the inequality 

(5.4) logE[exp (AX)] < a\ 2 

holds for every A such that |A| e [A 0 ,1], then it holds for every A e [-1,1], 

Proof. Since E[exp(|X|)] < oo, the function T := A h-> logE[exp(AX)] is inhn- 
itely differentiable on (-1,1). Its value and first derivative at 0 vanish, while 
its second derivative at |A| < 1 is bounded by E[X 2 exp (AX)]. We choose A 0 > 0 
sufficiently small that for every A e [-Ao,Ao], 

Viel, x 2 exp (Xx) < exp(x) + exp(-x). 

(By symmetry and monotonicity, it suffices to check the inequality for A = Ao-) 
In particular, for every A e [-A 0 , A 0 ], we have 

T"(A) < E[exp(X)] + E[exp(-X)] < 2a. 

The result then follows by integration. □ 

Proof of Theorem 5.1. For any positive integer m and C > 0, we denote by 
Al m (C) the assertion that for every cube □ satisfying 1 < |U| < 3 dm and every 
A e R, 

(5.5) logE[exp(A|D| /3 (!/(□) - E[z7(n)]))] < CA 2 , 
and that the same estimate holds with v replaced by Jl. 

Step 1. We show that for any given m 0 , there exists C < oo such that A mo (C) 
holds. Every cube □ satisfying 1 < |m| < 3 dm ° can be decomposed into a finite 
number of subcubes of side length between 1 and 2. By subadditivity, the 
hypothesis and Holder’s inequality, 

(5.6) logE[exp(A(z/(D) - E[z7(Ll)]))] < 1 + A 2 , 

uniformly over cubes □ satisfying 1 < |m| < 3 dm ° and A e R. The conclusion for 
v then follows by Lemma 5.3, and the reasoning for Jl is identical. 

Step 2. We show that there exists e > 0 such that for every m sufficiently large 
and C > 1, 

An-l(C) => An((l + 3 -£m )C). 

Since Ilm(l + 3~ em ) < oo, this will complete the proof of the proposition. 

Assuming Al m _i(C), we give ourselves a cube □ of side length R such that 
3 m_1 < R < 3 m . For notational convenience, we assume that □ = (0, R) d . We 
define a partition of □ into 3 d subcubes of side length L, each subcube being 
surrounded by a layer of smaller cubes of side length I - 3 7m for some ye (5, 1). 
(We choose a triadic decomposition of □ for coherence with the rest of the 
paper, but a dyadic one would be fine too.) In order for the partition to be 
well-defined, we ask that L - (R - 21)/3 be an integer multiple of l. This 
requirement can easily be taken care of since i « R , so we will neglect it for 
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clarity. We let (zj)i<j< 3 d be the centers of the 3 d subcubes of side length L such 
that 

3 d 

((0, L) u (£ + L, £ + 2 L) u (2 1 + 2 L, R)) d = |J a L ( Zi ), 


i= 1 


and (z'j)i<j<N be the centers of the disjoint subcubes of side length £ such that 

( 3 d \ N 

(J I = U up to a set of null measure, 

*=i / i =i 

where N = ( R d - (3 L) d )/£ d . By subadditivity, 


v 


(□) s +1 ^(a<(^))- 


i= 1 


Let r, s e (l,oo) be such that l/r+ 1/s = 1. By Holder’s inequality, for every 
A > 0, 


logE [exp (\\nf (//(□) - E[z/(U)]))] 


(5.7) 


(5.8) 


< - log E 
r 


exp [ rA|n|^ YJPinUzi)) - E[z7(D)]) 


+ - log E 

s 


□/ 


N 


exp ( sA|n|^^ £(!/(□*(**)) - E[i/(a)]) 


We decompose the term in (5.7) into 


(5.9) -logE 
r 


I,_, | o d 

□ r 3 


exp | rA ^f^ - E K n T(-i))]) 


Id- 1 3 


- £[?(□)]), 


and likewise, the term in (5.8) into 


(5.10) -logE 
s 


□/ 


N 


exp [ E(^(n^(^)) - E[K D K^))]) 


+ E(Ep(D,(j'))] - E[J?(n)]). 


It follows from sub-/superadditivity and (5.1) that 

E[i7(n)] > Jim E[/A(U)] = Jim E[/I(D)] > Ef/Jjlllz,^))]. 

I □1^-00 |D|^oo 

Hence, the second term in (5.9) can be estimated using (5.1): 

E[v(n L (zi))]-E[v(n)] < 


1^1“ ’ 


E[v(n e (z'))]-E[v(n)]< 


!□* 


and similarly, 
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By independence, the first term in (5.9) is equal to 


i 3 d 

-E lo s E 


r . 


i= 1 


□ 


exp [ rX-rz^zp(v(n L (zi)) - E [v(n L ( Zi ))]) 


which by the induction hypothesis is bounded by 


(5.11) 


3 J C 


A r 


PlI 1 -* ' 

ini 1 -^ 


< CrA 2 , 


since [3 < 1/2 and 3 rf |D< |D|. In order to estimate the hrst term in (5.10), we 
split the set < N} into 3 d subsets { z'- , j e Z±}, ..., { z[ } , j e Z 3 d} in such 

a way that if j i e Z ^ and j 2 e Z^ 2 with k\ ± k 2 , then the cubes and 

Ui(zj 2 ) are at distance at least i from one another. By Holder’s inequality and 
independence, we get that the hrst term in (5.10) is bounded by 


1 N 

E lo S E 


3 d s 


j =1 


□ 


exp [3 sX—^iviU^)) -E[zy(n^(z'))]) 


which by the induction hypothesis is bounded by 

. 2 


(5.12) 


CIV 

~¥s 


3 d sX 


Inl 1 -^ 


< C3 s 


^ 1 ° 


!□! 


£| a 2 . 


To sum up, we have shown that 

logE [exp (A|m|^ {y{u) -£[!/(□)]))] < C (r + 3 d s^j^) A 2 + Acyj^L 


□/ 


(recall that 3 d |Di| + iV|Lb| = |U|), where r, s e (l,oo) such that l/r+ 1/s - 1 are 
arbitrary. Recall that t = 3 7 " 1 with 7 € (A, 1). Since /3 < a, we can choose 7 
sufficiently close to 1 that 

|n|0 


!□< 


= 3‘ 


for some e > 0. Moreover, there exists a constant c d such that N < Q3 m B 7 h d B, 
hence 

logE [exp (A|mp (V(n) - E[R(D)]))] < C (r + SQ3 d 3 _m(1_7) ) A 2 + Ac3- me . 

We can now choose s = 3 m ( 1 _ T ')/2 anc [ thus obtain that for every A > 0, 


(5.13) logE [exp (A|□ | /3 (/ 2 (D) - E[z/(U)]))] 

< C ((1 - 3 -™^)/*)-! + c d 3^ (1 -7)/ 2 ) A 2 + Xc3~ m£ . 

The same reasoning applied to -jl shows that for every A < 0, 

(5.14) logE [exp (Xpf (//(□) - E[p(D)]))] 

< C ((1 - 3~™(l- 7 )/2)-l + Cd3 d-m(l- 7 )/2) A 2 _ \c 3 ~ m£ . 

Moreover, by (5.1), we can replace E[/x(lH)] by E[z7(Ll)] in (5.14) provided we 
replace c by 2c in the right side of (5.14). Since v > Jl, we deduce that for every 
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A e M, 

logE [exp (A|D| /3 (p(Q) - E[z7(Ll)]))] 

< C ((1 - 3-™(l-7)/2)-l + Cd sd-m( l-7)/2 j A 2 + 2 |A| C 3- m£ . 

The result for v follows by Lemma 5.3 and similar reasoning applies to Jl. □ 

We now complete the proof of Theorem 1.1 by combining Proposition 4.1 
and Theorem 5.1. 

Proof of Theorem 1.1. Let a < a' < 1/d, p e M. d be a unit vector, and cj = a p. 
By Proposition 4.1, there exists c < oo such that for every □ e C, 

(5.15) E[u(a,p)] -E[p(D,g)] -p-q < pp. 

We apply Theorem 5.1 with V = v(-,p) and ft = fi(- ,q) + p ■ q. The mea¬ 
surability assumption on v and Jl and the property that v < Jl clearly hold. 
Assumption (5.2) is also satisfied, since v and Jl are bounded. We thus obtain 
the existence of a constant C < oo such that for every cube □ e C and A e M, 

logE[exp(A|n| a (z/(D,p) - E[z/(D,p)]))] < CA 2 . 

Moreover, the constant C does not depend on the unit vector p (since the same 
is true of the constant c in (5.15)). By Corollary 4.2, we obtain, for every A el, 

< C{\ + A 2 ). 

The extension to arbitrary p e then follows by homogeneity. The reasoning 
for /i is identical. □ 

6. SUBLINEAR GROWTH OF THE CORRECTORS 

In this section we prove Theorem 1.2. The proof naturally divides into two 
main steps: first we introduce a functional inequality, which appears to be 
new and which we term the multiscale Poincare inequality. It converts control 
of spatial averages of gradients into control over the function itself. Then we 
show, using Theorem 1.1, that estimates of spatial averages of the gradient 
of the correctors can be reduced to the convergence of the subadditive energy 
quantities. 

In this section it is convenient to work with triadic cubes, so we change the 
notation from the rest of the paper: for every meN, 

6.1. Multiscale Poincare inequality. Here we present an inequality which 
gives an estimate of the iL _1 norm of Vw in terms of spatial averages of Vu in 
cubes. This can be seen as a generalization of the usual Poincare inequality 
giving the bound, for every iteLf 1 (D m ), 

(6.1) -f \u(x) - (u) n I dx < C(d)3 2m -f |Vw(x)| 2 dx. 

Jam Jam 


logE 


exp 


(a|q| c 


K q >.p) -^p-^p 
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The sharpness of the scaling of the constant C 3 2m in the Poincare inequality 
is, of course, realized by considering an affine function. In the following propo¬ 
sition, we show that this scaling can be improved for functions with gradients 
having small spatial averages relative to their absolute size: roughly, if the 
gradient is canceling itself out, then the function has smaller oscillation. 

Proposition 6.1 (Multiscale Poincare inequality). Fix me N and, for each 
n eN, n < m, define Z n 3 n 7L d n O m . There exists a constant C(d) < oo such 
that, for every u e H l {n m ), 


(6.2) |w-(w) 


□m ll_L 2 (D m ) 


+ Vm L-i 


tr (Dm) 


m —1 / 

scivuiL^ + cis" [\z n r y |(v«) 

n =0 


yzZ n 


y+O n 


Proof. We first prove the estimate for || Vu||_H-i( Dm ) and then deduce the esti¬ 


mate for \\u - (u) 


□m II L 2 (D m ) 


as a simple consequence. Without loss of generality, 


we may suppose (u) 0m = 0 . 

Step 1. The estimate for ||Vu||j i -i( nm ). Fix rj e H 1 (n m ]'R. d ) with 


f |W7(*)f 


dx = 1. 


We denote by w e the unique (up to additive constants) solution of 

the Neumann problem 

f - Aw = -V -r) + b in D m , 

[ d v w - 0 on dn m , 

where b -f D V • rj(x ) dx is chosen to ensure solvability. Then according to [16, 
1], we have w e H 2 (n m ) and 

(6.3) -f \v 2 w(x)\ 2 dx < C J- |V? 7 (x )| 2 dx - C. 

dm dm 

Testing the equation for w with u and using (u) Dm = 0 yields 

(6.4) 4 - V«(i) • r](x) dx - + Vu(x) ■ Vw(x) dx. 

'-'dm ^ dm 

For every n e {1,..., m} and z e Z n , we have 
/ Vu(x)-(vw(x)-(Vw) z+a J dx 

J Z+On 

E [ V«(i)-(Vw(x)-(Vw) 9+Dn J 

, nfsi+ml Jy+On -1 


dx 


y^^n—1 n(2+Dn) 

+ 


n ™-il E ((^ 7 ' u; )2+n„ 

yeZ n -ir(z+nn ) 

By the Poincare inequality, 

E I(Vw) wDb - (^) y+ a n J < C3 2n f |vMx )| 2 dx. 

yzZ n -m(z +□„) Jz+U n 
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Therefore, after summing over z e Z n and using Holder’s inequality, we get 

E / Vu(i) ■ (Viu(i) - (Vu;) (+D J dx 

ztZ n Oz+D„ 

^ E f Vu(x)-(Vw(x)-(Vw) y+nn J dx 

ytZn-1 J V+ a n -1 


+ C 3 ”<‘*W/' | v 2 „ W |^AT y | ( vu) 

' \yeZ n -1 

Iterating this and using (6.3), 

/ Vu(x) ■ X7w(x) dx < ^ / Vu(x) ■ (Vw(x) - (V'R') z+Do ) dx 
Jam Z tz 0 Jz+o 0 


i m_1 / 

+ C'|n m |5 £ 3 n(l+d/2) | £ |( Vw ) 
71=0 


\y^Z n 


2 /+Dn 


By the Poincare inequality and (6.3), 

E f \Vw(x)-(Ww) z+ao \ 2 dx<C f \Vrj(x)\ 2 dx = C\n m \. 

ZtZ 0 Jz+ °0 J Dm 

Thus using Holder’s and Young’s inequalities, we obtain 
/ Vu(x) • Vu>(x) dx 

Dm 

<C|D m |^( [ \Vu(x)\ 2 dxY +C\n m \^ ^ 3 -(i+d/ 2 )/ £ |( Vw ) 

/ n-n \yeZ n 


y+a„ 


Using (6.4) and rearranging the inequality yields the desired conclusion after 
taking the supremum over all such rj. 

Step 2. The estimate for |MlL 2 (n m )- By the representation theorem, there exists 
0 e L 2 {U m ) such that / n |0(x )| 2 dx - 1 and 

1 

(6.5) (-f \u(x)\ 2 dx\ = -f u(x)4>(x) dx - J- u(x) {(/)(x) - (0) Dm ) dx. 

/ -J Dm 'J Dm 

Denote by w e H 2 (\J m ) the unique (up to additive constants) solution of the 
Neumann problem 

|-Aw = 0 ~( 0 ) Dm in □„», 

| d u w = 0 on dn m . 

We have that w e H 2 {U m ) and 

-f | S/ 2 w(x)\ 2 dx < C -J- \4>(x) - (0) Dm | 2 dx < C. 

Testing the equation for w with u yields that 

J - u(x) (0(x) - (0) Dm ) dx - -j- Vm(x) • Vu’(x) dx. 

DDm ^ Dm 

The conclusion now follows from the previous step and (6.5). □ 
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6.2. Proof of Theorem 1.2. Throughout this subsection, we let 

v(x, U,p) ■= -u(x, U,p, 0) - p ■ x. 

That is, v(-. U,p) e Hq(U) is the minimize!' in the definition of u(U,p) with the 
plane x >-» p ■ x subtracted. 

The multiscale Poincare inequality motivates us to prove the sublinearity of 
the correctors by studying the spatial averages of its gradient in mesoscopic 
cubes. This is accomplished by a very simple energy comparison argument 
combined with the Lipschitz estimate, which reduce the needed estimates to 
the convergence of the subadditive quantities. The statements we need are 
given in the following two lemmas. 

In the first lemma, we compare Vu(-,D m ,p) in the large cube U m to the 
gradient of the function obtained by gluing together the functions u(-, z + U m ,p) 
on the mesoscopic subgrid {z + U n : z e Z n } and bound the difference in terms 
of the cell problem energies in these cubes. 

Lemma 6.2. For every peR d and m,n e N with m>n, 

(6.6) \Z n \~ L Y \\Vv(-, a m,p)-Vv(-,z + n n ,p)\\l2( z+an) 

z£Z n 

< -v(n m ,p) + \z n \~ x Y v(z + n n ,p), 

Z^.Zn 

where Z n := 3 n 7L d n D m . 

Proof. The lemma is a simple consequence of the uniform convexity of the 
energy functional and a proof can be found in [4, Lemma 2.1]. For the reader’s 
convenience, we also provide the argument here. Define V e !/,((□,„) to be 
the function obtained by gluing together the functions v(-,z + □„,£>) for z e 
3 n Z d n n m . In other words, V = v(-, z + U n ,p) in z + for each z € 3 n Z d n \J m . 
Since x p ■ x + v(x, O m ,p) is a solution of (1.1), we have 

(a(p + Vu(-,D m ,p)) ,VP) Dm =0. 

Using this and a direct computation, we find that 

i|vu(-,n m ,p)-vu|| 2(Dm) 

— 2 ^rniP) _ Vh, a (Vu(’, Dmjp) — Vh ))n m 

^(p + VF,a(p + VF)) 0m - ^(p +Vu(x,n m ,p),a(p+Vu(x,n m ,p))) Dm 
= \\ZnV l Y v(z + n n ,p)\-v(n m ,p). 

This completes the proof. □ 

We now use the Lipschitz estimate to upgrade the previous estimate. This 
will give us uniform local control as we blow up the macroscopic cube to 
the whole space and thereby yield information on the stationary correctors. 
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Lemma 6.3. Let X be as in the statement of Theorem 2.1. There exists 
C(d,A ) < oo such that, for every p e M. d , m,n,k e N with m>n>k and X < 3 k , 
we have the estimate 

(6.7) \\Vv(-,n m ,p) -s/v(-,n n ,p)\\ L 2 {Dk) 

1 

m— 1 / \ 2 

^ C Y, ( -^(Dt+i, p) + 3~ d Y v( z + n hP)\ ■ 

l=n \ zi3 l 7. d na i+1 ) 


Proof. Fix k e N with X < 3 k . By Theorem 2.1 (here we just use the Lipschitz 
estimate) and the previous lemma, for every l e N, l > k, we have 

II Vu(-, □«+! ,p) - Vv(-, D h p) || 2(Dfc) 

<C\\Wv(-,a l+1 ,p) - Vv(-,n h p)Wl^a,) 

<c[-u{U U i,p)+3~ d Y v(z + ni,p)\. 

\ 26 3 ( Z d nD i+1 / 


Note that the first inequality in the display above is obvious if l = k, and is a 
consequence of Theorem 2.1 for l > k + 1. Taking the square root, summing 
over l - n,... ,m - 1 and using the triangle inequality yields (6.7). □ 


We next combine the previous two inequalities with the multiscale Poincare 
inequality to obtain estimates on the corrector itself in a large macroscopic 
cube terms of u in triadic subcubes. 

Lemma 6.4. Let X be as in the statement of Theorem 2.1. Then, for every 
meN with 3 m > X, 

(6.8) |$(-,p) - ($(-,p)) Dm \\ L 2 (Dm) + |V$(-,p)L-i (Dm) 

1 

771-1 / \ 2 

< C\p\ + C Y 3 n I -K n m,p) + I ZnY 1 Y U ( z + °n,p) I 

n =0 \ ztZ n / 

+ C3 m y (-K D i+i, p ) + 3_d Y v( z + a hp)] ■ 

l=m \ ze3 l Z d nOi+i J 

Proof. Letting m-*■ oo in (6.7) yields that, for every me N with 3 m > X, 


(6.9) 


II - Vv(-, □ m ,p)|| i i (Dm) 

OO / 

-p(n«+i,p)+ 3_d Y v( z + n hP) 

l=m \ zeS l Z d nDi+i 


1 

2 
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For n e N with n < m, we use the fact that v(-, z + U n ,p) e Hq(z + □„) and use 
integration by parts, the Holder inequality and Lemma 6.2 to get 

\^n\ XI |(Vn(-, 

yzZn 

< \Zn\- 1 E |vn(-,n m ,p) - vv(-,z + n n ,p)\\l 2(z+Dri) 

z^.Zn 

< cl -v(n m ,p) + \Z n \~ l E v(z + \3 n ,p )). 

\ zaZ n ) 

Applying Proposition 6.1, we obtain 

\\v(-,a m , P ) ||^ 2 (Dm) + \Wv(-,n m ,p)\\ K -i (am) 

1 

m_1 / 1 , .o\ 2 

<C'||vn(-,n m ,p)||^ (Dm) + C'E3 n \\ZrY 1 E |(Vn(-,n m ,p)) y+D J 

n=0 \ ye.2y / 

i 

m-l / \ 2 

< C|p| + C E 3 n I + l^np 1 E + Q n,p) j • 

n =0 \ / 

By the previous display, (6.9), the triangle inequality and the fact that the L 2 
norm is stronger than the LA 1 norm, we get 


IIV$( - ,p)|f/ _ i (nm ) < |Vn(-,n m ,p)| i£ -i (Dm) + |V < h(-,p) - Vn(-,n m ,p)||^-i (Dm) 

< |Vn(-,D m ,p)| il -i (nm) + C|vd>(-,p)- Vn(-,n m ,p)|i2 (Dm) 

1 
2 


m-l 

< C\p\ + C E 3 n ( -v(U m ,p) + \ZrJT 1 E V ( Z + Q n,p) 

n=0 


z&Z n 


+ C3 m E ( -K D I+I,p) + 3 d E K 2 + D i.p) 

zeS^^nDi+i 


l=m 


This is the desired estimate for ||V$(-,p)|| i? -i( nm ). To complete the proof 
of ( 6 . 8 ) and obtain the estimate for ||$(x,p) - ($(-,p)) l=u || £; 2( n y we argue 
similarly, but there is an extra step in which we use the Poincare inequality: 

\\$(x,p) -($(-, p)) a J L Hn m ) 

< 2 ||n(-,n m ,p)||^ 2 (Dm) + ||$(t,p) -v(-,n m ,p) - ($(-,p) -n(-,n m ,p)) Dm ||^ 2(nm) 

1 

< C \\v(-,a m ,p)\\ L \am) + C3m [ { |v$(x,p) - Vv(x,a m ,p)\ 2 dx) 

\ 'J dm / 

1 

m-l / \ 2 

< C\p\ + C E 3 n -u(n m ,p) + I Z n [ l E u ( z + a n,p) 


Z^Zyi 


+ C3 m E -v(p ul) p) + 3~ d E K* + °i,p) 


□ 


l-m 


ze 3 l z d na u 


We are now ready to complete the proof of Theorem 1.2. 
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Proof of Theorem 1.2. By standard comparisons, it suffices to establish the 
result with B R replaced by U m , me N. We may also assume |p| < 1 by homo¬ 
geneity. Fix /3 e (0, |) and define, for each n e N, 


3 ( m n ) ( -v(U m ,p) + \Z n \ Y u ( z + a n,p) if n < m - 1, 


2j y-) 


Sn. < 


-v(a n+ll p) + 3 d Y v(z + n ni p) 

26 3"Z d nm„ + i 

By Lemma 6.4, the theorem is proved if we can show that 

\ 2 \ 


if n > m. 


logE 


exp 


[ A3 2m/3 ( 




£ 

\n =0 


< (7(1 + A 2 ). 


We let (3' e (/3,1/2), and apply Jensen’s inequality with respect to the measure 
£n=o 3~ n ^'-^5n and the convex function x >-» exp (a; 2 ) to get 

\ 2 \ 


exp I A3 2m/? s n ) I < C Y 3“ n(/5 ' _/3) exp (\3 2m ^ +2n ^'-^s 2 n ) . 

\ \n=0 / / n =0 

We analyse hrst the sum over n>m. By Theorem 1.1 with a = 2(3'/d, 

oo oo 

Y 3‘ n(/3 '-' 3) E [exp (A3 2m ^ +2n( ^'~^ ) s 2 )] < Y 3“ n(/3 '^ } exp [C (l + 3 -^^) A 2 )] 

n-m n-m 

< exp [(7(1 + A 2 )]. 

For the sum over n < m, we use Jensen’s inequality and Theorem 1.1 to get 


logE 

Therefore 


exp 


XS 2nP' 


-p ■ ap - \Z n \ Y u ( z + a n,p) 

^ ztZ n 


<C{ 1 + A 2 ). 


m— 1 

Y 3“ n(;5 ' _/3) E [exp (A3 2m/3+2n(/3 '- /3) s 2 )] 

n=0 

m-1 

< X! 3- n(/3 '-^ exp [0(1 + 3 _2 ( 1 -^)( m_n ) A 2 )] < exp [C (l + A 2 )]. 

n=0 

This completes the proof. □ 
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